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Figure 1 

CGAGAGACGACAGAAGGTTACGGCTGCGAGAAGACGACAGAAGGGTCCAGAAAAA 

GGAAAGTGCTGGAGGGGAGTGGGGACAAAAGCAGCGACCAAGTGAATGTCACTTC 

AGTGACTGAGGCCAGGCAAAACGCGCGGGAAGGATTTTGTGTAGCTTGGGACCCTT 

TCATAGACACTGATGACACGTTTACGCAAAATAGAAATTTGAGGAGAAACGCCTGG 

GCCTrCGGAAAGGAGTGATTGATTAGTACTTGCAAGTTTAGGTGACTTTAAGGAGAA 

CTAACTAATGTATACTATTGAGGGAGGAGGAAGAGCATTACAGAGTTTCCAGCAGC 

AGCAGGAAAGCTTTGGTTAATTTGGAAATGGATGATAGCATTAAAATAACAGAAGC 

GCCTCCAGGTCTCTGAAGCTTCAGTCCCCCAGCTGAAAGCCAGAAAAGACTAAGCC 

CACTAAGC(nTITGATCCCTrrGGAAC}CAAAGAA(nTrCCTTCCCTGGGGTGAAGAC 

TCTCCTCAGAAGATTTCCTGTCTCTGCCTATGTTACAAGAGGAATCAAAACCAAGAC 

AGAAGAGCTCAGGATGCAGGTGAGAGGCAGGGAAGTCAGCGGCTTGTTGATCTCCC 

TCACTGCTGTCTGCCTGGTGGTCACCCCTGGGAGCAGGGCCTGTCCTCGCCGCTGTG 

CCTGCTATGTGCCCACAGAGGTCTCACTGTACATTTCGGTACCTGACCTCCATCCCAG 

ATGGCATCCCGGCCAATGTGGAACGAATAAATTTAGGATATAACAGCCTTACTAGAT 

TGACAGAAAACGACTTTGATGGCCTGAGCAAACTGGAGTTACTCATGCTGCACAGT 

AATGGCATTCACAGAGTCAGTGACAAGACCTTCTCGGGCTTGCAGTCCTTGCAGGTC 

TTAAAAATGAGCTATAACAAAGTCCAAATCATTCGGAAGGATACTTTCTACGGACTC 

GGGAGCTTGGTCCGGTTGCACCTGGATCACAACAACATTGAATTCATCAACCCTGAG 

GCCTTTTATGGACTTACCTCGCTCCGCTrGGTACATTTAGAAGGAAACCGGCTCACA 

AAGCTCCATCCAGACACATTTGTCTCATTAAGCTATCTCCAGATATTTAAAACCTCTT 

TCATTAAGTACCTGTTCTTGTCTGATAACTTCCTGACCTCCCTCCCAAAAGAAATGGT 

CTCCTACATGCCAAACCTAGAAAGCCTGTATTTGCATGGAAACCCATGGACCTGTGA 

CTGCCATTTAAAGTGGTTGTCTGAGTGGATGCAGGGAAACCCAGATATAATAAAAT 

GCAAGAAAGACAGAAGCTCTTCCAGTCCTCAGCAATGTCCCCTTTGCATGAACCCCA 

GGATCTCTAAAGGCAGACCCTrrGCTATGGTACCATCTGGAGCTTTCCTATGTACAA 

AGCCAACCATTGATCCATCACTGAAGTCAAAGAGCCTGGTTACTCAGGAGGACAAT 

GGATCTGCCTCCACCTCACCTCAAGATTTCATAGAACCCTTTGGCTCCTTGTCTTTGA 

ACATGACANANNTNTCTGGAAATAAGGCCGACATGGTCTGTAGTATCCAAAAGCCA 

TCAAGGACATCACCAACTGCATTCACTGAAGAAAATGACTACATCATGCTAAATGC 

GTCATTTTCCACAAATCTTGTGTGCAGTGTAGATTATAATCACATCCAGCCAGTGTG 

GCAACTTCTGGCTTTATACAGTGACTCTCCTCTGATACTAGAAAGGAAGCCCCAGCT 

TACCGAGACTCCTTCACTGTCTTCTAGATATAAACAGGTGGCTCTTAGGCCTGAAGA 

CATTTTTACCAGCATAGAGGCTGATGTCAGAGCAGACCCTTTTTGGTTCCAACAAGA 

AAAAATTGTCTrGCAGCTGAACAGAACTGCCACCACACTTAGCACATTACAGATCCA 

GTTTTCCACTGATGCTCAAATCGCTTTACCAAGGGCGGAGATGAGAGCGGAGAGAC 

TCAAATGGACCATGATCCTGATGATGAACAATCCCAAACTGGAACGCACTGTCCTGG 

TTGGCGGCACTATTGCCCTGAGCTGTCCAGGCAAAGGCGACCCTTCACCTCACTTGG 

AATGGCTTCTAGCTGATGGGAGTAAAGTGAGAGCCCCTTACGTTAGCGAGGATGGG 
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CGAATCCTAATAGACAAAAATGGGAAGTTGGAACTGCAGATGGCTGACAGCTTtGA 

TGCAGGTCnTACCACTGCATAAGCACCAATGATGCAGATGCGGATGTTCTCACATA 

CAGGATAACTGTGGTAGAGCCCTATGGAGAAAGCACACATGACAGTGGAGTCCAGC 

ACACAGTGGTTACGGGTGAGACGCTCGACCnTCCATGCCTTTCCACGGGTGTTCCAG 

ATGCTTCTATTAGCTGGATTCTTCCAGGGAACACTGTGTTCTCTCAGCCATCAAGAG 

ACAGGCAAATTCTTAACAATGGGACCTTAAGAATATTACAGGTTACGCCAAAAGAT 

CAACJGTCATTACCAATGTGTCK3CTCK:CAACCCATCAGGGGCCGACTTTTCCAGTT^ 

AAAGTTTCAGTTCAAAAGAAAGGCCAAAGGATGGTTGAGCATGACAGGGAGGCAG 

GTGGATCTGGACTTGGAGAACCCAACTCCAGTGTTTCCCTTAAGCAGCCAGCATCTT 

TGAAACTCTCTGCATCAGCTTTGACAGGGTCAGAGGCTGGAAAACAAGTCTCCGGTG 

TACATAGGAAGAACAAACATAGAGACTTAATACATCGGCGGCGTGGGGATTCCACG 

CTCCGGCGATTCAGGGAGCATAGGAGGCAGCTCCCTCTCTCTGCTCGGAGAATTGAC 

CCGCAACGCTGGGCAGCACTTCTAGAAAAAGCCAAAAAGAATTCTGTGCCAAAAAA 

ck:aagaaaataccacagtaaagccagtgccactggctgttcccctcgtggaactca 

CTGACGAGGAAAAGGATGCCTCTGGCATGATTCCTCCAGATGAAGAATTCATGGTTC 

tgaaaactaaggcttctggtgtcccaggaaggtcaccaactgctgactctggaccag 

TAAATCATGGTTTTATGACGAGTATAGCTTCTGGCACAGAAGTCTCAACTGTGAATC 

CACAAACACTACAATCTGAGCACCTTCCTGATTTCAAATTATTTAGTGTAACAAACG 

GTACAGCTGTGACAAAGAGTATGAACCCATCCATAGCAAGCAAAATAGAAGATACA 

ACCAACCAAAACCCAATCATTATCTTTCCATCAGTAGCTGAAATTCGAGATTCTGCT 

CAGGCAGGAAGAGCATCTTCCCAAAGTGCACACCCTGTAACAGGGGGAAACATGGC 

TACCTATGGCCATACCAACACATATAGTAGCTTTACCAGCAAAGCCAGTACAGTCTT 

GCAGCCAATAAATCCAACAGAAAGTTATGGACCTCAGATACCTATTACAGGAGTCA 

GCAGACCTAGCAGTAGTGACATCTCTTCTCACACTACTGCAGACCCTAGCTTCTCCA 

GTCACCOTCAGGTTCACACACCACTGCCTCGTCTTTATTTCACA'rTCCTAGAAACAA 

CAATACAGGTAACTTCCCCTTGTCCAGGCACTTGGGAAGAGAGAGGACAATTTGGA 

GCAGAGGGAGAGTTAAAAACCCACATAGAACCCCAGTTCTCCGACGGCATAGACAC 

AGGACTGTGAGGCCAGCAATCAAGGGACCTGCTAACAAAAATGTGAGCCAAGTTCC 

AGCCACAGAGTACCCTGGGATGTGCCACACATGTCCTTCCGCAGAGGGGCTCACAG 

TGGCTACTGCAGCACTGTCAGTTCCAAGTTCATCCCACAGTGCCCTCCCCAAAACTA 

ATAATGTTGGGGTCATAGCAGAAGAGTCTACCACTGTGGTCAAGAAACCACTGTTAC 

TATTTAAGGACAAACAAAATGTAGATATTGAGATAATAACAACCACTACAAAATAT 

TCCGGAGGGGAAAGTAACCACGTGATTCCTACGGAAGCAAGCATGACTTCTGCTCC 

AACATCTGTATCCCTGGGGAAATCTCCTGTAGACAATAGTGGTCACCTGAGCATGCC 

TGGGACCATCCAAACTGGGAAAGATTCAGTGGAAACAACACCACTTCCCAGCCCCC 

TCAGCACACCCTCAATACCAACAAGCACAAAATTCTCAAAGAGGAAAACTCCCTTG 

CACCAGATCTTTGTAAATAACCAGAAGAAGGAGGGGATGTTAAAGAATCCATATCA 

ATTCGGTTTACAAAAGAACCCAGCCGCAAAGCTTCCCAAAATAGCTCCTCTTTTACC 

CACAGGTCAGAGTTCCCCCTCAGATTCTACAACTCTCTTGACAAGTCCGCCACCAGC 

TCTGTCTACAACAATGGCTGCCACTCAGAACAAGGGCACTGAAGTAGTATCAGGTG 
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CCAGAAGTCTCTCAGCAGGGAAGAAGCAGCCCTTCACCAACTCCTCTCCAGTGCTTC 

CTAGCACCATAAGCAAGAGATCTAATACATTAAACTTCTTGTCAACGGAAACCCCCA 

CAGTGACAAGTCCTACTGCTACTGCATCTGTCATTATGTCTGAAACCCAACGAACAA 

GATCCAAAGAAGCAAAAGACCAAATAAAGGGGCCTCGGAAGAACAGAAACAACGC 

AAACACCACCCCCAGGCAGGTTTCTGGCTATAGTGCATACTCAGCTCTAACAACAGC 

TGATACCCCCTTGGCTTTCAGTCATTCCCCACGACAAGATGATGGTGGAAATGTAAG 

TGCAGTTGCTTATCACTCAACAACCTCTCTTCTGH3CCATAACTGAACTGTTTGAGAA 

GTACACCCAGACTTTGGGAAATACAACAGCTTTGGAAACAACGTTGTTGAGCAAAT 

CACAGGAGAGTACCACAGTGAAAAGAGCCTCAGACACACCACCACCACTCCTCAGC 

AGTGGGGCGCCCCCAGTGCCCACTCCTTCCCCACCTCCTITTACTAAGGGTGTGGTT 

ACAGACAGCAAAGTCACATCAGCTTTCCAGATGACGTCAAATAGAGTGGTCACCAT 

ATATGAATCTTCAAGGCACAATACAGATCTGCAGCAACCCTCAGCAGAGGCTAGCC 

CCAATCCTGAGATCATAACTGGAACCACTGACrCTarCTCTAATCTGTTTCCATCCAC 

TTCTGTGCCAGCACTAAGGGTAGATAAACCACAGAATTCTAAATGGAAGCCCTCTCC 

CTGGCCAGAACACAAATATCAGCTCAAGTCATACTCCGAAACCATTGAGAAGGGCA 

AAAGGCCAGCAGTAAGCATGTCCCCCCACCTCAGCCTTCCAGAGGCCAGCACTCAT 

GCCTCACACTGGAATACACAGAAGCATGCAGAAAAGAGTGl 1 1 1 lGATAAGAAACC 

TGGTCAAAACCCAACTTCCAAACATCTGCCTTACGTCTCTCTACCTAAGACTCTATTG 

AAAAAGCCAAGAATAATTGGAGGAAAGGCTGCAAGCTTTACAGTTCCAGCTAATTC 

AGACGTTTTTCITCCTrGTGAGGCTGTTGGAGACCCACTGCCCATCATCCACTGGACC 

AGAGTTTCATCAGGANTTGAAATATCCCAAGGGACACAGAAAAGCCGGTTCCACGT 

GCTTCCCAATGGCACCTTGTCCATCCAGAGGGTCAGTATTCAGGACCGTGGACAGTA 

CCTGTGCTCTGCATTTAATCCACTGGGCGTAGACCATTITCATGTCTCTTTGTCTGTG 

GTTTTTTACCCGGCAAGGATTTTGGACAGACATGTCAAGGAGATCACAGTTCACTTT 

GGAAGTACTGTGGAACTAAAGTGCAGAGTGGAGGGTATGCCGAGGCCTACGGTTTC 

CTGGATACTTGCAAACCAAACGGTGGTCTCAGAAACGGCCAAGGGAAGCAGAAAGG 

TCTGGGTAACACCTGATGGAACATTGATCATCTATAATCTGAGTCrnTATGATCGTG 

GTTTTTACAAGTGTGTGGCCAGCAACCCATCTGGCCAGGATTCACTGTTGGTTAAGA 

TACAAGTCATCACAGCTCCCCCTGTCATTATAGAGCAAAAGAGGCAAGCCATCGTTG 

GGGTTTTAGGTGGAAGTTTGAAACTGCCCTGCACTGCAAAAGGAACTCCCCAGCCTA 

GTGTTCACTGGGTCCTTTATGATGGGACTGAACTAAAACCATTGCAGTTGACTCATT 

CCAGATTTTTOTGTATCCAAATGGAACTCTGTATATAAGAAGCATtXCTCCTTCAGT 

GAGGGGCACTTATGAGTGCATTGCCACCAGCTCCTCAGGCTCAGAGAGAAGGGTAG 

TGATTCTTACTGTGGAAGAGGGAGAGACAATCCCCAGGATAGAAACTGCCTCTCAG 

AAATGGACTGAGGTGAATTTGGGTGAGAAATTACTACTGAACrGCTCAGCTACTGG 

GGATCCAAAGCCTAGAATAATCTGGAGGCTGCCATCCAAGGCTGTCATCGACCAGT 

GGCACAGAATGGGCAGCCGAATCCACGTCTACCCAAATGGATCCTTGGTGGTTGGG 

TCAGTGACGGAAAAAGACGCTGGTGACTACTTATGTGTGGCAAGAAACAAAATGGG 

AGATGACCTAGTCCTGATGCATGTCCGCCTGAGATTGACACCTGCCAAAATTGAACA 

GAAGCAGTATTTTAAGAAGCAAGTGCTCCATGGGAAAGATTTCCAAGTTGACTGCA 
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TCAACAATGTAGCCCAAGCTGATGACAGTGGCTATAGGACCAAGAGGTACACCCTT 

TTCCACAATGGAACCTTGTATTTCAACAACGTTGGGATGGCAGAGGAAGGAGATTAT 

ATCTGCTCTGCCCAGAACACCTTAGGGAAAGATGAGATGAAAGTCCACCTAACAGT 

TCTAACAGCCATCCCACGGATAAGGCAAAGCTACAAGACCACCATGAGGCTCAGGG 

CTGGAGAAACAGCTGTCCTTGACTGCGAGGTCACTGGGGAACCGAAGCCCAATGTA 

TTTTGGTTGCTGCC1TCCAACAATGTCATTTCATTCTCCAATGACAGGTTCACATTTC 

ATGCCAATAGAACTTTGTCCATCCATAAAGTGAAACCACTTGACTCTGGGGACTATG 

TGTGCGTAGCTCAGAATCCTAGTGGGGATGACACTAAGACATACAAACTGGACATT 

GTCTCTAAACCTCCATTAATCAATGGK^CTGTATGCAAACAAGACrGTTATTAAAGCC 

ACAGCCATTCGGCACTCCAAAAAATACrTTGACTGCAGAGCAGATGGGATCCCATCT 

TCCCAGGTCACGTGGATTATGCCAGGCAATATTrTCCTCCCAGCTCCATACTTTGGA 

AGCAGAGTCACGGTCCATCCAAATGGAACCTTGGAGATGAGGAACATCCGGCTTTC 

TGACTCTGCGGACTTCACCTGTGTGGTTCGGAGCGAGGGAGGAGAGAGTGTGTTGGT 

AGTGCAGTTAGAAGTCCTAGAAATGCTGAGAAGACCAACATTCAGAAACCCATTCA 

ACGAAAAAGTCATCGCCCAAGCTGGCAAGCCCGTAGCACTGAACTGCTCTGTGGAT 

GGGAACCCCCCACCTGAAATTACCTGGATCITACCTGACGGCACACAGTTTGCTAAC 

AGACCACACAATTCCCCGTATCTGATGGCAGGCAATGGCTCTCTCATCCTTTACAAA 

GCAACTCGGAACAAGTCAGGGAAGTATCGCTGTGCAGCCAGGAATAAGGTTGGCTA 

CATCGAGAAACTCATCCTGTTAGAGATTGGGCAGAAGCCAGTCATTCTGACATACGA 

ACCAGGGATGGTGAAGAGCGTCAGTGGGGAACCGTTATCACTGCATTGTGTGTCTG 

ATGGGATCCCCAAGCCAAATGTCAAGTGGACTACACCGGGTGGCCATGTAATCGAC 

AGGCCTCAAGTGGATGGAAAATACATACTGCATGAAAATGGCACGCTGGTCATCAA 

AGCAACAACAGCTCACGACCAAGGAAATTATATCTGTAGGGCTCAAAACAGTGTTG 

GCCAGGCAGTTATTAGCGTGTCAGTGATGGTTGTGGCCTACCCTCCCCGAATCATAA 

ACTACCTACCCAGGAACATGCTCAGGAGGACAGGGGAAGCCATGCAGCTCCACTGT 

GTGGCCTTGGGAATCCCCAAGCCAAAAGTCACCTGGGAGACGCCAAGACACTCCCT 

GCTCTCAAAAGCAACAGCAAGAAAACCCCATAGAAGTGAGATGCTTCACCCACAAG 

GTACGCTGGTCATTCAGAATCTCCAAACCTCGGATTCCGGAGTCTATAAGTGCAGAG 

CTCAGAACCTACTrGGGACTGATTACGCAACAACTTACATCCAGGT ACTCTGA CAGG 

AAGGGGGAGACTAAAATTCAACAGAAGTCCACATCCACAGGGTTTATTTTTTGGAA 

GAAGTTTAATCAAAGGCAGCCATAGGCATGTAAATGAGTCTGAATACATTTACAGT 

ATTAAATTTACAATGGACATGCGATGAGACTTGTAAATGAAAGCATTGTGAACTGA 

AACCGAGTCTCTGTGGATCTCAAAGCAAACTCTTAACTrAAGGCACmTGATTTTGC 

CAACAAATAATAACAAACATTAAGAGAAAAAAATGATCCACTACGAAATAACAAAC 

GGCTAATGCACCTGAATTCTCAGTAAAAAGACCTTTCTCTCGCTAACAGTTGCCAGC 

TGCCTCGTGTCTGTTTCCTACCAATGTCACAAACATCGCACACAGGGTGAATGGAGT 

CAACGGGAAAGATTAAGTTTGCGGTCTGTGTAAATCTCAATGTACAAATATTCTGTC 

NCTGGTTTATAAACATTTTGATAAAACCGAAAAAAAAAAAAAAAAAAAAAAAAAA 

A AAA (SEQ ID NO: 1) 
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Figure 3 



mqvrgrevsgllisltavclvvtpgsracprrcacyvptevhctfryltsipdgipanve 

rinujynsltrltendfdgi^klellmlhsngmvsdktfsglqslqvlkmsynkvqn 

rkdtfyglgslvrli1ldhnniefinpeafygltslrlvhlegnrltklhpdtfvslsylqi 

fktsfikyij^dnfltslpkemvsympm^lyi^gnpwtcix:hlkwijsew^ 

ikckkdrsssspqqcpix;mnpriskgrpfamvpsgaflctkptidpslkskslvtqedngs 

astspqdfiepfgsl^l^mtxxsgnkadmvcsiqkpsrtsptaftee^^dyimlnasfstnl 

vcsvdynfflqpvwqllalysdsplilerkpqltetpslssrykqvalrpediftseadvr 

adpl^fqqekjvlqlnrtatllsllqiqfstdaqialpraentraerlkwtmilmmnnpk 

lertvlvggtiai^cpgkgdpsphlewixadgskvrapyvsedgrmrok^gklelqma 

dsfdaglyhcistndadadvltyritvvepygestifflsgvqhtvvtgetldlpclstgv 

PDASISWlLPGNTVFSQPSRDRQILhnsrGTmiIX2VTPKDQGHYQCVAANPSGADFSSFKV 

SVQKKGQRMVEHDREAGGSGLGEPNSSVSLKQPASLKLSASALTGSEAGKQVSGVHRK 

NKliRDLlHRRRGDSTLRRF^EFiRRQLPl^ARRroPQRWAALLEKAKXNSWKKQENTTV 

KPWLAVPLVELTDEEKDASGMIPPDEEFMVLKTKASGVPGRSPTADSGPVNHGFMrSI 

ASGTEVSTVWQTLQSEHLPDFKIJSVTNGTAWKSMNPSIASKIEDTTNQNPniFPSVAE 

IRDSAQAGRASSQSAHPVTGGNMATYGHTNTYSSFTSKASTVLQPINPTESYGPQIPrrGV 

SRPSSSDISSHTTADPSFSSHPSGSITITASSLFHIPRNNNTGNFPLSRHLGRERTIWSRGRV 

KNPHRTP\HLRRHRHRTVRPA1KGPANKNVSQWATEYPGMCHTCPSAEGLTVATAALS 

VPSSSHSALPKTNNVGVIAEESTTVVKJ<PLLLFKD^ 

ASMTSAPTSVSLGKSP\TDNSGH1^MPGTIQTGKDSVETTPLPSPI^TPSIPTSTKFSKRKTP 

LHQJFVNNQKJCEGMLKKPYQFGLQKNPAAKLPKIAPLLPTGQSSPSDSTTLLTSPPPAL^ 

TMAATQNKGTEVVSGARSLSAGKKQPFTNSSPVLPSTISKRSOTLNFLSTETPTVTSPTAT 

ASVIMSETQRTRSKEAKDQIKGPRKNRNNANTTPRQVSGYSAYSALTTADTPLAFSHSP 

RQDDGGNVSAVAYHSTTSLLAITEU^EKYTQTLGNTTALETTLLSKSQESTTVKRASDTP 

PPIJ^SGAPPVPTPSPPPFTKGVVTDSKVTSAFQMTSNRVWIYF^SRHNTDLQQPSAEAS 

PNPEITTGTTDSPSM.FPSTSWALRVDKPQNSKWKPSPWPEHKYQLKSYSETEKGKJIPA 

VSMSPHI^LPEASTHASHW^^^QKHAEKSVFDKJa > GQ^OTSKHLPYVSLPKTLLK^ 

GKAASFTWANSDVFLPCEAVGDPLPIIHWTRVSSGXEISQGTQKSRFHVLPNGTLSIQRV 

SIQDRGQYLCSAFNPLGVDHFHVSLSVVFYPARILDRHVKEITVHFGSTVELKCRVEGMP 

RPTVSWILANQTVVSETAKGSRKVWVTPDGTLIIYNLSLYDRGFYKCVASNPSGQDSLL 

VKIQVITAPPVIffiQKRQAIVGVLGGSLKLPCTAKGTPQPSVHWVLYDGTELKPLQLTHS 

RFPLYPNGTLYmSIAPSWGTYECIATSSSGSERRVVILTVEEGETIPRIETASQKWTEVN 

LGEKLLLNCSATGDPKPRIIWRLPSKAVIDQWHRMGSRIHVYPNGSLVVGSVTEKDAGD 

Y1X;VARNKMGDDLVLMHVRLRLTPAKIEQKQYFKKQVLHGKDFQVDCKASGSPVPEV 

SWSLPDGTVLNNVAQADDSGYRTKRYTLFHNGTLYFNNVGMAEEGDYICSAQNTLGK 

DEMKVHLT\O.TAIPRIRQSYKTTMRLRAGETA\a.DCEVTGEPKPNVFWLLPSNNVISFS 

NDRFTFHANRTLSniKVKPLDSGDYVCVAQNPSGDDTKTYKLDIVSKPPLINGLYANKT 

VIKATAIRHSKKYFDCRADGIPSSQVTWIMPGNIFLPAPYFGSRVTVRPNGTLEMRNIRLS 



DSADFTCVVRSEGGESVLVVQLEVLEMLRRPTFRNPFNEKVIAQAGKPVALNCSVDGNP 

PPEITWILPDGTQFAmPIiNSPYmAGNGSLlLYK^TRNKSGKYRCAARNKVGYIEKLIL 

LEIGQKPVILTYEPGMVKSVSGEPLSLHCVSDGIPKPNVKWTTPGGHVIDRPQVDGKYIL 

HENGTLVIKATTAHDQGNYICRAQNSVGQAVISVSVMVVAYPPRIINYLPRNMLRRTGE 

AMQLHCVALGIPKPKVTWETPPJiSLLSKATARKPIiRSEMLIffQGTLVIQNLQTSDSGVY 

KCRAQNLLGTDYATTYIQVL (SEQ ID NO: 2) 
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Nacl 



IrrrooAAc 



TCCAATGGAATACTAClCAACTTAra 

GNATGOTCTX^GGATCTTGAGTa^ 

ACTX^CTQATAAQTO<XrrATTTG^ 

TrnTOTGTGTATAATATTAAOA^ 

TGTTTCACnXKiATTACCOOAAGAAAATAATAACrrOA^ 



AOGTOACAAAQTATCXTK^CACATT OfoAAAAQACCAC^ 
<yUTTCATATTMCTCX^GTTCCXXATOT^ 
AACTTCAOCAQCTTCXnACACAQAQCAQTTACT 
CCArfAAOCIXX^CIT^AA^ 
TGTAAAOTATCnX^ACCAAAQAQCATOQTC^TGCT 
TAATTITATOOTAOAAATOTC^ACAAGCTrOTXXlAQGCTO 
QATOCXXTTTCAOTrT^ 
ATITAACXXCOCAAGTAGG<IATAATTTTAA^^ 
TCTTCTAAAAAQACTTlXKn-ATGCATACKU 
AAATAGGAQAAACTGAQCTAGAATAGGTATAAAATATTTI^^ 
ATAGGTTATATAAOAOGTOGGAA^ 
ACTA<KX3ATOCTXKK3AAAATACqTAClCT 
(XUCXTOACKncrGOAAOGCr^ 
GACXTTTAGTCTCK?AA<X3AGCTTTOQAAQ 
C^TOCCATATTTCCAAGQCAAOTQTOCAC^^ 
CACnXKTTAAQOACCTTGTTCGTQCCACAC^ 
TrCUCTXTrrAATrCTAOCTTGATX^ 
(UTATOQAATATTACTAATIT^^ 
ATTCTQTOGAGCCCAQTT^CGTGGOTITOAAT^ 
ATTACCTCUOATGAOrrC^^ 
CTGATOAAGTAAGC^TATAAAGTGAQAGCAGCATO 
TOCTATTATAATGTGOAAATAAACTOACTTTAAAAAAAAAAQCr^ 
GAOTAAATCHTCXATAAATATCATCriX^ 
ftAACCTITAGAAATTTTOTTAACACAAAG 
TGOTOO<3<XTOAQAOATGGA(yTITAAOAAC^ 
ATXTKKXHX^AAACArrTATCCCAA^ 
ACAOQQATTAAACAGTCATTTTAGTCACATTTCTX^ 
CAQGATOTATCAGGATAACATCX^COT 
MCAA^CGA^C^CXX^TAAAAAmTCAOQ^ 
CLVAAAAGTATTATAT<nTCIT^^ 
ATAGACAAAQATQOTTTATGCAAOTGGGGATOGCQ^ 
CAMQGAACVUCTA(>OAGTOAA^ 
TrTAATAATTCTATrrMQATACA^^ 
ATAaAQAQAATAM(TOATAQATAanTIXmTrATCT^ 
TGTACACCTCAAACATAQATAACCAACAAAOAQ^ 
OQAAQAOTACATTCQAAAGTCAGACACCATKrr^ 
QAAAOTAOCCAAOGAQCTAAA(KK^TCTQCAAC^ 
AACCAGTACCCCOQAQCiriTGACK^ 
CCATCACTXX3AAAGAOAOGaXArnmCTTXK^ 
ATACCA(KK}<XAAAAAC*30<X^^ 
GQACTrTTXXn-ATAGGATTCKiAAATCnAAAT^ 

A NacIS17 

AAAAAAAAAAAAAAAAAAAAAGOAAGGTCAGACACXrrCACT^ 
GCAAACAGAAGGGGAGTCACAAACCXZAGGACAAAOCACAG 
TGTTATTGCTGTTGTTGTTA(XACX^TCATTAGCA 
CTATGACATGTTTTTTTATTCAAGTATATCACAT^ 



# 
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CACACACACACACACACACACACAO^ 
(^CXXMQCITAAlXXnTTCCnt^^ 

QCGAQOCCCKXXX^CAaACKX^ 

GQAOGAGGAAAClXXKAGOOGAGOCTXyKnTgO 

jCXXATKXX^GAAOCmVQQ^ 

ACCOCTAOA<XXnXSA(X*OAC0(X^ 

QAQCG<XnxXrrMT(XXX^COQQAAGT0AA 

iE»»l(N«c^61) 

ACTCGAGGOGAGTJXjOOACAkAAGCA(XXlKCCA\ 

GGCAAAA<X^GCGGOAAGGATTATATGTAGOCl<3<XrAOGCTXTC 

TTC?lCCAAAGCAAjtX2AArrrGAGG^G 

A<nACrrCTAA<7ITrAGQTGA<^^ 

AOCATra^GCAGCAGCAGCA<K^WX^GCAATC^ 

ATGTATX5ATACCATTAAAATAAC^GAAC<XrOCIXXAGTr^ 

<nxiAAGACTAAG<xrrACTAA.G<X7rrnxxnxxxxrr^ 

TGAA<X5CTCTCCTCAGAAGATIT(XriOT 
AGAAGACCTCACOTA'nXKX^CyCr^ 

4 N«c4203 
TOTTAAATACAGTTTX1AQOCITAAGTG 
CTTCTCTTATAACTAACI^ 
CATCTTTOTTTATCTAATAATAGCAAGC^ 
CAAAAATATTCAGTGATOTAACAQTGACAGTGTAGO^ 
TTAATITOGGTTAAACTCATT^^ 

CACQTQCGGGTTAAAGATATTTTCTAACAAGAGAAGCAQA^ 

CCATCACTOTGTn^GT^^ 

TOCCTOCAMGTCAGAQACTCAGC^^ 

^QTTGQCTCTGAAAAGGCCCATGTOTCCT^ 

GAAATTTQAOCTTTTTrTCTA(7lX3GGTTA^ 

CCAAATAAATAAAACACATATTAAATAAT^^ 

GTACATTCATAAATGCAATGTOAAAAATATATATAATTTTTA 

TTITCTAAAITOCACATCnACT^^ 

AGTGTAAAAGGCTCTTGGTCAAACAAQAGAQTTAAA 

AAGATTATTATQATCICTAATGAC^GGGATCCTGCTT^ 

AAQATATGTTTACAATAATAAGCCCATTrAGT<MXAAAGTCX^AA 

ACTGQTTTATAAAGAGAACACnTKXnXjAQT^ 

AorrocTCTrrcrccAGATTr^ 

CATrATAAQFIXKiAATAAGATAQCUTCTC^ 

CTTTCAOOTCrrnxnTlX^GTTCACT 

TAATTXTmOGAQncrrAtm^ATIXX^ 

ACCTX^AAACTAGAACrACCAATOCATrACTOO^ 

GAATITATTGCTAC^TTOTTCATTAT^ 

OCOGTCAAAATATCAAOGOAT AAAGAAAATQTGGAAATXnxnAGAA 

QCCATAAAAQAGTGAAATCATQACATATGCACXKMTQQA^^ 

GGCTAATCAAAACAAQAC^GACTCAAAAAC^ 

QAAGCTAGATTTA(^CrrOTAarrGOOCAT^ 

ATTCTAATCItrTGAGTOTCrrATA^ 

TQCKn-AGGAGAQAAAGCAATGAQAGaAOTTAATAAQAACGAAGCATA^ 

(XAGGATQAAATGCATrAATTlCTATCCT^ 

AACCA(XXnT>AAATCCCAQCACA^ 

CAAOAAQCCATrTGCAGTTGCTGCCrQCTQQ^ 

ACTQTGTATAACAACCAGTTGACAATACAAAGTTGGC^ 

THXlATTTTirnCTTACTGrnT^^ 

ATTTOTTTCTTTTOAGAGAGAGAAQOAACATQAAATrGGOT 
ATCnXWAAQAAQTTXXKKjAAAGAQAAAAArrQTATGGA^ 
AACAAACAAAAGGTTX>JTITGCCACAAAAA 
CTTAAAGAAAATATTCCCAATTATK^ 
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CTTOAACTOQCCCTATA^^ A 
GTCA<X7XXXKX^X^TCIXXX1^ 

Ta^TOCCAGAOX?<X^TCX^ 

QQA(xnrTXKxnx3ATcnx^ 

AAACACCTOTCMTTACTrrrAATAGTTTTC^ 

TGGAGACAATOAAC^rTTTATCXX^ 

GTTTXK30AGCXnX^TAATTATO<XK}CAQCT^ 

AC^QTTTCKjITAACAATCATrATX3ATA(XXAGGAA 

AAAAGCATTTXriTIT(XAATrrATAAATACXXA^ 

CCTTACTAAATOTGACAAATTATGGT^ 

ATAAATGCACAAGAAATGCK14TATX3CCATC^ 

ATOOCAMCACTArronCACOTTTXKU 

TATAACXOTATACCX^GTGCC^^ 

CTCXXTATGAOGGAGGGGCCTreOGCTOAGCACT^ 

(KK)AO<n7TTATX5ATAAClTOACXXAAOCrAQAQTCACTOQA 

GAGAACATGCCTGAAGAAGATCAGATTATAG^^ 

TOATTCATO<KKX*G<XKXX^^ 

AAAAAAAAACAGGCTGAGCAAGTCTC^GGAGCAAG^ 
TCTClXK^TCAGCKX^^ 

CAOTXiATCAA(>ATCATGTXK}AACTATAAGCCAAATA^ 

TTOGTCATXUTCmTCATCAGAGT^ 

CCTTTGGACTAAATTCTGTTATCTATAGCTO^ 

CTA(^CCAGAGTaTAATa<KK3CCTaA(K^ 

GGGGC1 "1 CI TAACCGGAACACAGTTGTAAAAGGG AATTTTCT^ 

ExMi3(NttcSOI9)l 

TCTOGACTTTAATTGACGATTCATC^^ 



AAAACXHXXXXJATTCCCGTGGGA 
TCAAGGAAGGATTTCAATTGTAATCfcAATTT^^ 
CTGGGGAAAATGCTTACACAArrGCGAGGACCTAATTTAGG^ 
AAAG<^CAGGATGG<XKX^GACACCTGCA^ 
CCC^GAGACTCATTCKKX&A^^ 
ACTTGACIX^GAAACTAATGTGGAAAGCATC^ 

CATGCATGAATAAGGGATXXX^GAGAGAA<KK3AAGAAAAAGGAAGGAAOGM 

AA<3GAAGGAaGGAAGGAAQGAACKjAAGGAAGGAAGAGAGGGA<^ 

AAGGAAGGAAAGGGAAAGGAAAAAAGAGATGGGOAGG<UGGOAAGGAAAGGAAA^ 

OGAflAAAGAACUGAAGAAACKUAAATAAAT^^ 

TTTTATCCATAAAAGGTCATTIXX^C^ 

AAGGGCAGTCnTAAGATAGTAGCATTTTATAAACXAT^ 

TTTTTA7XXTCTACCATCTTCAAACTOAAACT 

TGTAAGAGTTTIX^CACACCXSAAGTOGGAAAC^ 

AATX3K3GGAGAACAAAAAGAGACTGCACAGGGAGCTC 

TTGAGAAGCTAATTXUOGCroCATGACrACTCA^ 

ATGAAAAG<^CAATAAAGTACATATGT^^ 

ATAATAGCAATCAAAGCAATAA(^ 

ACTATGTCTAGTGTITIXjACITAGGGTACATAGTATGCTO 

ATTTAG<nX>TCAGACAGCATITGGAACATGTATCTT^ 

CKATGAAATTCTCCACCTATGTTTArrC^ 

GGGTTTGAGOTTTTACTCOCAGGTAACATT^ 

CTTnATTTATGAGGGATGTTCK)TAT^ 

GGTC AAAACAGAAAAATATGTAAAAAGTtATTGTTTTTATTAGTAT^ 

TTTTAAAAATXKTTATGCTTAGAACTAATTAAGATTAGATTAGAT^ 

AGAGAGGOATrrGATOAATXK^AAAG^TCATGAAAAA^ 

(^GAATCAATI AAATTCATATT A CTA1 AA AAGACAGCACGCX^AGATG TGTGCCAGCTG AG 
GAGTGGATAAACTGTGTAACGTGAGTGCTATGTAGAAACAGAAAGGAGTGAAGGGTTX3A ' 
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TGTCKXXTTX3CAACATCTTG 

ACATATTOCATOOTTATQTlTATATGAAATtJm 

OTAACHTOTTACTTAfiOOTCAGGAAAAQATTAAA 

ATCTTAAT1TOGGGAAAAOAGAATITCCT 

ATXXCKjITKjiATAT^ 

GCAGOAGGAAOTOAAATOAOATGOT^ 

GGTGa^ATCTAOATATCnTAATATATTaACATAAAOaAAa^ 

AAAA(XAACAQ<3AQTQAGG<jAGA^ 

TGTGTGTGGAGACACCATAATAAAA^ 

TAAAAACAAAACATTTTTGCACAAO^ 

GAAGTTGAAGTTCATTGATACnxnX^TAAATClTAAAT^ 

TTTAmnAATrACTcrracTar^ 

ATCKTITTCACCATACACAAATQTAOOCAGAAGAGTG^ 
CK5AOAGOK3AAAG<3AAGC^^ 
ATGAACAAGAAAAAATGTGGACACATG^ 
TATTCrrAATATTAAGAAAAT^^ 
TTATCTTOGCCACAGATCCTT^^ 
AATCCAATCCAQ<KK^TQATCKnTACTXX^ 
AAQTAACGATOACATCTCTCTOTTCCITr 
GTATGTTCTAATTlCrCTAA 
TGGAGTTAAG<7ITXCATAGG^^ 



TIXjTGAAAGTTOCTTTGAGAAA^ 

acctataacaaactocaaataattoagaaggatactito 

TIGCACCJGGATCACAACAACATTGAGTrrATCAA 

TCOXTIGGTACATCTACAACCA^ 

GAGCTATCTCCAGATATTTAAAACCTCCTTCATTAA 

A<XTXXCTCCCAAMGAAAT<^^ 
ACCXXTGGACCWIGACHtXX^m 



ACTATCTIXjTTTGTTTXjTTTCTTnT^ 

TATCCCAAAAQTCCCOCATAAOCTCCCCCC^ 

CCCTXKK^TTCCCCTGTACTG^ 

AQT(KTQGCCAACTAQO(XATCTTTT^ 

TACTGCnTAGTTGATAATCTTGTT^ 




(SEQ IDNO:3) 



Figure 8 



Exon/Intron 


Exon 


Exon 


Exon 


Intron 


No. 


start 


end 


length 


length 


1 


1 


542 


542 


2356 


2 


2898 


3094 


197 


1334 


3 


4428 


4557 


130 


3068 


4 


7625 


8019 


394 





Figure 9 



1 3661 4203 6559 6755 SOZ9 1218 11286 U6S0 

(-3661) (l) (542) (2S9S) 0094) (4428) (4557) (7625) (SOW) 



Putative pro»o<or 
regkm 





r 1 




r 1 


r ^ 


1 


r 1 


| Exaal 




Exd&2 
1*ATG 




Exon3 




B*on4 
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Figure 10 



cDK\jrat 
cDK\rat 

cDNA._rat 
geAOHicJtm 

cOK&_rat 

cDNA_rat 

cDtlA_rat 

cDM\_rat 
<y«nomic_hu 

Cpxeiami. c_hu 

cDNA_rat 

genomic_hu 
cDNA_rAt 

gtmomiojhu 

eDHA^rat 
gGnoni.c_hti 

g«noai.e_hu 

yenomicjhu 
cDNA_rat 



1 ogagagacgacagaaggttacggctgcgagaagacgacagaagggtccag 

1 ' 

51 aaaaaggaaagtgctggaggggagtggggacaaaagoagogaccaagtga 

1 ~ 

101 atgtcacttcagtgactgaggccaggcaaaaogogcgggaaggattttgt 

1 

151 gtagcttgggaocctttcatagacactgatgacacgtttacgoaaaatag 

1 

201 *aatttgaggagaaaogcctgggccttcggaaaggagtgattgattegta 

1 

251 cttgcaagtttaggtgactttaaggagaactaactaatgtatactattga 

1 ~ 

301 gggaggaggaagagcattacagagtttccagcagcagcaggaaagctttg 

1 

351 gttaatttggaaatggatgatagcattaaaataaeagaagcgcctccagg 

1 

401 tototgaagottoagtcccccagctgaaagccagaaaagactaagcccac 

1 — 

451 taagccttttgatccctttggaagcaaagaactttccttcectggggtga 

! - 

501 agactctcctcagaagatttcctgtctctgcctatgtt«Lcaagaggaatc 

j „ . . . 

551 aaaaccaagacagaagagctcaggatgeaggtgagaggcag9gaagtcag 

1 " 

€01 oggottgttg&totocotoaotgctgtctgcotggtggtcaoooctggga 

! ~ 

651 gcagggcctgtcctcgccgctgtgcctgctatgtgcccacagaggtgcac 

1 

701 tgtacatttcggtacctgacctccatcccagatggoatcccggccaatgt 

1 

751 ggaacgaataaatttaggatataacagccttactagattgacagaaaacg 

1 

801 actttgatggcctgagcaaactggagttactcatgotgcaoagtaatggc 

1 - 

851 attcacagagteagtgacaagaccttctcgggcttgcagtccttgcaggt 

1 

»01 ottaaaaatgagctataacaaagtccaaatcattoggaaggatactttct 

1 



o 

□ 
in 

ru 



t'Li 
1:3 

ru 





$51 


genonicjbu 


1 




1001 


geaottiojtai 


1 


c&KA i?at 


1051 


gwnomicjhu 


1 




1101 


gonoaicjhu 


1 




1151 


geno&iojtaa 


1 


«nuft f 

GUAA & A W 


1201 


gonCmicJbu 


1 


dJNA. rat 


1251 


gono«Lio_lva 


1 


CDMA rat; 


1301 


geo0&±c__tLU 


1 




1351 


genomic^Jiu 


1 


GDKA_r*t 


1401 




1 




1451 


gen<mic_hu 


1 


eDKA_irat 


1501 


gondii. a__hu 


1 


cDH\_rat 


1551 


g©no«±c_hu 


1 




1601 


genomlajhu 


1 


cDJs*A_rat 


1651 


ganoaic_hti 


1 


d?NA rat 


1701 


g*nc*ie_hu 


1 


eDK\jrat 


1751 




1 


<*DNA_rat 


1801 


gaivomicjiu 


1 




1851 


genomi a_hu 


1 




1901 




1 


d>NA_rat 


19S1 


genomicjhti 


1 
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acggactcgggagcttggtccggttgcacctggatcacaacaacattgaa 

ttcatcaaccctgaggccttttAtggacttacctcgctccgcttggtaca 

tttagaaggaaaccggctcacaa^gctccatccagacacatttgtotoat 

taagotatctccagatatttaaaacctctttcattaagtacctgttcttg 

tctgataacttcctgacctccctccoaaaagaaatggtotootaoatgcc 

aaacotagaaagcctgtatttgcatggaaacccatggacctgtgactgcc 

atttaaagtggttgnotgagtggatgoagggaaacooagatataataaaa 

tgcaagaaagac&gaagctcttocagtoctoagcaatgtoocctttgcat . 

gaaccccaggatctctaaaggcagaccctttgctatggtaooatctggag 

ctttcctatgtacaaagccaaccattgatccatcactgaagtoaaagagc 

ctggttactcaggaggacaatgg&tctgcctccacctcacctcaagattt 

catagaaooctttggctocttgtctttgaacatgacananntntctggaa 

ataaggccgacatggtctgtagtatooaaaagccatcaaggacatcacca 

actgcattcactgaagaaaatgactacatcatgotaaatgcgtcattttc 

cacaaatcttgtgtgcagtgtagattataatcacatccagccagtgtggc 

aaottctggctttatacagtgactctcctctgatactagaaaggaagecc 

cagcttaccgagactccttcactgtcttctagatataaaoaggtggotot 

taggcctgaagacatttttaccagcatagaggctgatgtcagagcagacc 

ctttttggttccaacaagaaaaaattgtcttgcagctgaacagaactgcc 

accacacttagcacattacagatccagttttccactgatgctcaaatcgc 

tttaccaagggcggagatgagagcggagagactcaaatggaccatgatcc 






<J3HA_ rat 


2001 


tgatgatgaacaatcaoaaaotggaacgoaatgtactggttggcggcact 




cDM\_rat 
genomic hii 


2051 


attgccotgagctgtcoaggoaaaggogacccttcacctcacttggaatg 




cDKA._rat 
gonooio hu 


2101 


gottotagctgatgggagtaaagtgagagccccttacgttagogaggatg 




<jDNfc_rat 


2151 


ggcgaatcctaatagacaaaaatgggaagttggaactgcagatggctgac 




eDKAjcat 
genomiojha 


2201 


agctttgatgoaggtotttaccactgcataagcaccaatgatgcagatgc 




cDKXrat 
ge&omio_hu 


2251 


ggatgttetcacatacaggataactgtggtagagccctatggagaaagca 


□ 


CDKA_rat 
go&ott-Lo Hu 


2301 


cacatgacagtggagtccagcacacagtggttacgggtgagacgctcgao 






2351 


cttccatgcctttccacgggt'gttccagatgcttotattagotggattct 


□ 

in 


<SDKA_rat 
g«aomic_hu 


2401 


tccagggaacactgtgttctctcagcoatcaagagacaggcaaattctta 


ru 


oDtfA_rat 
g«no»iojbti 


2451 


acaatgggaccttaagaatattacaggttacgceaaaagatoaaggtcat 


t: 


oONArat 


2501 


taccaatgtgtggctgccaaccoatcaggggccgacttttccagttttaa 


□ 


c£KX_rat 
gonoAio_hu 


2551 


agtttcagttcaaaagaaaggccaaaggatggttgagcatgacagggagg 


ru 

r: 3 
i : 3 


rt"^T\ run i ^ Viit 


2601 


caggtggatctggacttggagaaoooaactccagtgtttcccttaagcag 


t x;- 

H 

ru 


c£)KA_rat 

gvnoaicjhu 


2651 


coagcatctttgaaactctctgoatcagctttgacagggtoagaggctgg 




cDN\_rat 

y QnOBJL C_QU 


2701 


aaaacaagtctccggtgtacataggaagaaeaaaeatagagacttaetac 




oDN\_rat 
g©aonAo fan 


2751 


atcgggggcgtggggattccacgctccggcgattcagggagoataggagg 






2801 


cagctccctctctctgctcggagaattgacccgoaacgctgggcagcact 




ganomic^hu 


2851 


tctagaaaaagccaaaaagaattctgtgccaaaaaagcaagaaaatacca 




cDN\_rat . 
genotaic_hoi 


2901 


cagtaaagccagtgccactggctgttcccctcgtggaactcactgacgag 




cDNA_xat 


2951 


gaaaaggatgcctctggcatgattcctccagatgaagaattcatggttct 



I 







3001 


gaaaactaaggcttctggtgtcooaggaaggtcaccaactgctgactotg 




g<?n<m±c__hu 








cDKK r&t 


3051 


gaceagt^aatcatggttttatgaogagtatagrttctggcacagaagtc 




<jano&io_lw 








<2DHA_rat 




tcaactgfcgaatiocacoaacacfcaQaatctgagcacctfcOQtgatttcaa 




g«TK>mi<2_hu 








cDHA rat 


3151. 


attatttagtgtaacaaacgg"t*cagcfcgtgaoaa,(i0agtafc<jaacccat 












cOKX_rat 




coitagcaagcaaaatagaagatacaaooaaooaaaacccafltcattatc 




96no*ioJtva 








c©NA_rat 


3251 


tttccatcagxagcT.gaaatLogayatuctgt>vt*yyv«y 




goiLO&LQ llTJ 








cDWA_rat 


3301 


ttccc*eagtgcaoaoootgtaacagggggaaacatggctacctatggcc 












• cDKA_rat 


3351 






g^iwalc 










3*01 


ccaat^aatCcaacagoaag^t*^yy a ^^ t ^*» a ^~^ wuo ***-«» v ™vv* l T» *- 


□ 


giactoai.C: bu 








<C*IA._rat 


3451 


cagcagacctagcagtagfcgacatctcttcfccacactactgoagacccta 










□ 


cDKA rat 


3501 


gcttctcoagtoacccttcaggttceca caeca ctgcotogtotttattt 


in 


genomic Ira 








eOfct&__r«Lt 
geno»ic_lia 


dJOJ. 


oacattcctagaaacaacaatacaggtaacttccoottgtccaggcactt 


iu 


oDNAjrat 


3 601 


rtorraAoaoaoaoaacaatttaoaocagagggagagttaaaaacccacata 




g«nomio_hu 






i! 


OUTtA -t <? V 


3651 


gaaccccagttctccGacggcatagacaoaggactgtgaggccagcaatc 




g<anomic_hu 






i'U 


cDKA i * t 


3701 


aagggacotgctaacaaaaetgtgagccaagttccagccacagagtacco 


gonooilo_hti 






i'U 




3761 


tgggatgtgccacacatgtccttccgcagaggggctcaoagtggotactg 


gonomi c_hu 






l,J 


aDHA__x^t 


3901 


cagcactgtcagttccaagttcatcccaoagtgooctccccaaaactaat 


1 * * 


g«nomic__hu 








cDNA r*t 


3851 


aatgttggggtcatagcagaagagtctaccactgtggtcaagaaaccact 




gono&il-O 








oDHA__rat 


3901 


gttactatttaaggacaaacaaaatgtagatattgagata&teacaacca 












CDJCA_rat 


3951 


ctacaaaatattccggaggggaaagtaaccaogtgattoctacggaagca 












<SDN\_rat 


4001 


agcatgacttctgctccaacatctgtatccctggggaaatctcctgtaga 




genomic hu 








«3DltX_rat 


4051 


caatagtggtcacctgagcatgcctggg*ccatcea.aactgggaaagatt 












cDNA rat 


4101 


cagtggaaacaacaccacttcccagcccccCcagcacaccctcaatacca 




gonomi-O hu 
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genomicjru 
eDNA_rat 

g^noalo hu 
cDNA_rat 

g^nofltLc bo 
g«txoaic_hu 

g€nosJ.c hu 

cDNA^rat 
genoajicjhru 

g^nomic_htL 

g Qno&i c__Ku 

cDKA_rat 
genomie_hti 

cDMAjcat 
gottonicjiu 

cDNa^xat 
gonom±c_hu 

genottic_ hu 

<*DHA_rat 
genoaicjhu 

cDNA_rat 

cDMA_rat 
genomic_hu 

cDNAjrat 

ganoitajbu 

g'cnosu.c hti 



415 
420 
425 
430 
435 
'440: 
445 
4S0 
455 
460 

465 
470 
475 
480 

485; 
490 

495 
500 

505; 
510 
BIS 
520 



acaag<^(^aaattctc&aagaggaaaactccottgc^c<^gatctttgt 
aaataaccagaagaaggaggggatgttaaagaatccatatoaattcggtt 
tacaaaagaacccagccgcaaagcttcccaaaatagctcctottttaccc 
aoaggtcagagttccccctcagattctacaactctcttgacaagtccgoc 
accagctctgtctaoaaoaatggatgccactcagaacaagggcactgaag 
tagtatoaggtgccagaagtctctcagcagggaagaaycagcccttcaoo 
aactoctotccagtgcttcctagcaccataagoaagagatctaatacatt 
aaacttcttgtcaacggaaacccpoaoagtgacaagtcctactgctactg 
catctgtcattatgtctgaaacccaacgaacaagatccaaagaagoaaaa 
gaccaaataaaggggcctcggaagaacagaaacaaegcaaacaccacccc 

caggoaggtttctggctatagtgcatactcagctctaacaacagotgata 
cccccttggctttcagtcattcccoacgacaagatgatggtggaaatgta 
agtgoagttgcttatcactcaacaaootctcttctggccataactgaact 
gtttgagaagtacacccagactttgggaaatacaacagctttggaaacaa 
cgttgttgagcaaatcacaggagagtaooacagtgaaaagagcctcagac 
acacoacceccactcctcagcagtggggcgcccccagtgccoactccttc 
cccacctccttttactaagggtgtggttacagacagcaaagtcacatcag 
ctttccagatgacgtcaaatagagtggtcaccatatatgaatcttcaagg 



ca 



caatacagatctgcagoaaocctcagcagaggctagccccaatoctga 
gatcataactggaaccactgactctocctctaatctgtttccatccactt 
ctgtgccagcactaagggtagataaaccacagaattctaaatggaagccc 
tctccctggccagaacacaaatatcagctcaagtcatactccgaaaccat 
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Ml 

I! 



cDtA_rat 
ciDNJl^rat 
cON\_rat 
cON\xat 
cttULxat 

oDHX_rat 

<JDNA_rat 
geno«Lic_h.u 

cDNA_rafc 
greno»ic_ hu 

cDStA_rat 

dDNA_rat 

cD*t\ rat 
geaomicjbu 

cDHA_rat 
g«noiLio_hu 

cDKA_rat 

d>NA_rat 
genom±o_hu 

g^tiomic 

cOKA^rat 
gcnomi o_tm 



525 

530 

535 

540 

545 

550 

555 

S60 

£65 

570 

Slt> 

580 

565 

590 

595 

€00 

€05 

€10 

615 

€20 

625 



tgagaagggcaaaaggccagoagtaagcatgtccccccacctcagccttc 



cagaggccagcactcatgcctcacactggaatacacagaagcatgcagaa 



aagagtgtttttgataagaaacctggtcaaaaoocaaottccaaacatct 



gccttacgtctctctacctaagactotattgaaaaagccaaga&taattg 



gaggaaaggotgcaagctttacagttccagctaattcagaogtttttctt 



ccttgtgaggotgttggagacecactgcccatcatccactggaocagagt 



ttcatoagganttgaaatatcccaagggacacagaaaagccggttccacg 



tgottoocaatggcaccttgtccatccagagggtcagtattcaggaccgt 



ggacagtacctgtgctctgcatttaatccaotgggcgtagaccattttca 



tgtotctttgtctgtggttttttacccggcaaggattttggacagacatg 



rcaagga^u^acagttcactttggaagtactgtggaactaaagtgcaga 



gtggagggtatgccgaggcctacggtttcctggatacttgcaaaccaaac 



ggtggtctcagaaacggccaagggaagcagaaaggtctgggtaacacctg 



atggeacattgatcatotataatctgagtctttatgatcgtggtttttac 



aagtgtgtggccagcaacccatctggccaggattcactgttggttaagat 



acaagt cat ca cagctocccctgt ca tt a t agagca a aa gaggcaagcca 



tcgttggggttttaggtggaagtttgaaactgccctgcactgcaaaagga 



actccceagectagtgttc^ctgggtcctttatgatgggactgaactaaa 



accattgcagttgactoattccegatttttcttgtatccaaatggaactc 



tgliatataagaagcatcgctccttcagtgaggggcacttatgagtgcatt 



gccaccagctcctcaggctcagagagaagggtagtgattcttactgtgga 
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dDNA_rat 
genomicjkra 

dDNAjr&t 
gaaoaic_hu 

genoaicJbTi 

cDNA_rat 
g^noaic_hu 

cBNA_xat 
ganoa±o_hu 

cDNA_rat 
geaoulc_htt 

c*>KA_x*t 
genomic Jhu 

eDMA_**t 
genomia_hu 

eDNA_rat 
ge&on±c_hti 

genomio_hu 

cDNA_x:at 
gdnomifijhu 

cDKArat 
geno»ic_to 

eDNAjrat 
gdaom±o_hu 

c£NA_rat 
gano&ic_hu 

eDNAjrat 
gvnomicjbu 

oDKA_rat 
genoaio_hu 

GDNA_rat 
g^nomic_hu 

cDNA_rat 
gonomic^hu 

eDNA_rat 
genomio_hu 

genomicJhTi 

cDHA_rat 
genomic_hu 



6301 agagggagagacaatccccaggatagaaactgcctctcagaaatggactg 

1 

6351 aggtgaatttgggtgagaaattactactgaactgeteagctaetggggat 

1 

6401 cceaagcctagaataatctggaggctgccatccaaggctgtcatagacca 

1 7 

6451 gtggcacagaaTGGGCIUK^CGAATCCAC^TCTACCXIAAATGGATCCTTGg 

1 TGGGCAGCTGGATCCACGTCTACCCTAATGGATCCCTGt 

6501 tggttgggTCAGTGRCGGAAAAAGACGCTGGTGACTACTTATGTQTGGCA 

40 ttattggaTC^GTAACAGAAAAAGACAGTGGTGTCTACTTGTGTGTGGCA 

6551 AGAAACftAAATGGGAGATGACCTAGTCCTGATGCATGtccgCQtgAGATT 

90 AGAAACAAAATGGGGGAT GATCTGATACTGATGCATGTt a gcct aAGACT 

6601 GACACCTGCCAAAATTGAACAGAAGCAGTATTTTAagAAGCAAGTGCTCC 

140 GAAACCTGCCAAAATTGACCACAAGCAGTATTTTAgaAAGCAAGTGCTCC 

6651 ATGGGAAAGATTTCCAAGTTGACTGCAaggcctctGGCTCCCCTGTGCCT 

190 ATGGGAAAGATTTCCAAGTAGATTGCAaagcttccGGCTCCCCAGTGCCA 

6701 GAGGTATCCTGGAGTTTGCCTGATGggacagtgcTCAACAATGTAGCCCA 

240 GAGATATCTTGGAGTTTGCCTGATGgaaccatgaTCAACAATGCAATGCA 

6751 AGCTGATGACAGTGGCTATAGGACcaagaggtacACCCTTTTCCACAATG 

290 AGCCGATGACAGT GGCCACAGGACt agg ag a t atACCCTTTTCAACAATG 

6801 GAACCTTGTATTTCAACAACGTTGggatggcaGAGGAAGGAGATTATATC 

340 GAACTTTATACTTCAACAAAGTTGgggtagcgGAGGAAGGAGATTATACT 

6651 TGCTCTGCCCAGAACACCTTAGGGAAAGATGAGATGAAAGTCCACCTAAC 

390 TGCT ATGCGGAGAACACCCTAGGGAAAGATGAAATGAAGGTCCACT TAAC 

6$01 AGTTCTAACAGCcatcCCACGGATAAGGCAAAGCTACAAGACCACCATGA 

440 AGTTATAACAGCtgotCCCCGGATAAGGCAGAGTAACRAAACCAACAAGA 

6951 GGCTCAggGCTGGAGAAACAGCTGTCCTTGACTGCGAGGT a 

490 GAATCAaaGCTGGAGACACAGCTGTCCTTGACTGTGAGGTCACTGGGGat 

7001 cogaagoccaatgTATTTTGGTTGCTC^TCCAAQaatgtcArtTCATT 

540 cccaaaooaaaaatATTTTGGTTGCTGCCTTCCAAtgaoatgATTTCCTT 

7051 CTCCAATGACAGGTTCACATTTCAtGCCAATagaaCTTTGTCCATCCATA 

590 CTCCATTGAT AGGTACACATTTCATGCCAAT gggtCTTTGACCAT CAACA 

7101 AAGTGAAACCACTTGACTCTGGGGActatgtgtgcgtagotoagAATCCT 

640 AAGTGAAACTGCTCGATTCTGGAGAgtacgtatgtgtagcccgaAATCCC 

7151 AGTGGGGATGACACT AAGACATACAAACTGGAcatt GT CTCTAAACCTCC 

690 AGTGGGGATGACACCAAAATGTACAAACTGGAtgtgGTCTCTAAACCTCC 

7201 ATTAATCAATGGCCTGTAT GCAAACAagACTGTTATT AAAGCCACAGC ca 

740 ATT AATCAAT GGTCTGTATACAAACAg aACTGTT ATT AAAGCCACAGC tg 

7251 1 1 cggc acTCCAAAAAATACTTTGACTGCAGAGCAGAT GGGATCCCAT ct 
790 tgagacatTCCAAAAAACACTTTGACTGCAGAGCTGAAGGGACACCATct 

7301 tCCCaggtcacGTGGATTATGCCAGGCAATATTTTCCTCCCAGCTCCATA 
840 cctgaagtcatGTGGATCATGCCAGACAATATTTTCCTCACAGCCCCATA 
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CttTGG^GCAGAQTCACGOTCCATCCWVATGGAACCTTGGAgatgagga 
CtaTGGAAGCAGAATCACACTCCATAAAAATGGAACCTTGGAaattagga 

acatccGGCTTTCTGACTGTGCGGACTTCACCTGTGTGGttcggagcgag 
atgtgaGGCTTTCAGATTCAGCCGACTTTATCTGTGTGGcccgaaatgae 

ggaGGAGAGAGTGTGTTGGTAGTG(^TTAGAAGTCCTAGAAATGCTGAG 
g gt GGAGAGAGCGT GT TGGTAGT ACAGTTAGAAGTAC T GGAAftTGCT GAG 

AAGACCAACATTCAGAAACCCATTCAACGAAAAagtcatcgcccaagctg 
AAGACCGACATTTAGAAATCCATTTAATGAAAAaatagttgcccagctgg 

gcaagcccgtAGCACTGAACTGCTCTGTGGATGGGAACCCCCCACCTGAA 
gaaagtccacAGCATTGAATTGCTCTGTTGATGGTAACCCACCACCTGAA 

a 1 1 acCTGGATCTTACCTGACGGCACACAGTTTGCTAACAGACCAcacaa 
ataatCTGGATTTTACCAAATGGCACACGATTfTCCAA'TGGACCAcaaag 

ttccccGTATCTGATGGGAGGCAATGGCTCTCTCATCCTTTACAAAgcaa 
1 1 atcaGTATCTGATAGCAAGCAATGGTTCTTTTATCATTTCTAAAaoaa 

Ot cggaac&agt CAGGGAAGTATCGCTGTGCAGCCAGGAATAAGGTTGGC 
ctcgggaggatgCAGGAAAATATCGCTGTGCAGCTAGGAATTAAGTTGGC 

TAcatcgagaaactcatcctgttagagATTGGGCAGAAGCCAGTCATTCT 
TAtattgagaaattagtcatattagaaATTGGCCAGAAGCCAGTTATTCT 

GACATACGAACCAGGGAtggtgaagagcgTCAGTGGGGAACCGTTATCAC 
TACCTAT GCACCAGGGAcagt aa aaggoaT CAGTGGAGAATCTCTATCAC 

TGCATTGTGTGTCTGATGGgATCCCCAAGCCAAATGTCAAGTGGACTACA 
TGCATTGTGTGTCTGATGGaATCCCTAAGCCAAATATCAAATGGACTATG 

CCGGGTGGccATGTAATCGACAGGCCTCAAGTGGATGGAAAATACATAcT 
CCAAGTGGttATGTAGTAGACAGGCCTCAAATTAATGGGAAATACATAtT 

GCAT GAAAATGGCACGCI GGTCaTCAAAGCAACAACAGC t ca cga ceaAG 
GCATG ACAATGGCACCTTAGTCAT TAAAGAAGCAACAGCt ta t gacagAG 

GAAATTATATCTGTAGGGCTCAAAACAGTGTTGGCCAggcagttattagc 
GAAACTATATCTGTAAGGCTCAAAATAGTGTTGGTCAtacactgattact 

gtgtCAGTGATGGTTGTGGCCTACCCTCCCCGAATCATAAActacctACC 
gttcCAGTAATGATTGTAGCCTACCCTCCCCGAATTAGAAAtogtccACC 

CAGGAACATGCTCAGGAGGACAGGGGAAGCCatgCAGCTCCACTGTGTGG 
CAGGAGTATTGTCACCAGGACAGGGGCAGCCtttCAGCTCCACTGTGTGG 

CCTTGGGAATCCCCAAGCCAaAAGTCACCTGGGAGACGCCAAGAGACTCC 
CCTTGGGAGTTCCCAAGCCAgAMTC^CATGGGAGATGCCTGACCACTCC 

CT GCTCTCAAa a g caa cage aaga aa ac ccCATAGAAGTGAGATGCTTCA 
CTTCTCTCAAcggcaagtaaagagaggacaCATGGAAGTGAGCAGCTTCA 

CCCACAAGGT ACGCT gGTCAT TCAGAATCTCCAAACCT CGGATTCCGGa g 
CTTACAAGGTACCCTaGTCATTCAGAATCCCCATUVCCTCCGATTCTGGga 

tcTATAAGTGCAGAGCTCAGAACCTACTTGGgacTGATTACGCAACAACT 
taTACAAATGCACAGCAAAGAACCCACTTGGtagTGATTATGCAGCAACG 

TACATCCAGGTACTCTGACAGGAAgggggagactaaaattcaacagaagt 
TATATTCAAGTAATCTGACATGAAataataaagtcaacaacatctgggca 
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cDKA_rat 8401 Cca^twaGagggTTTATTTTTTGGAAGAAGTTTiy^TCAAAGGCAGCCA 

g*noaic_hu 1940 gaa --TTTATTTTTTGQAAGAAGTTTJ^TCAMGGCAGCCA 

<SDHA_rat 8451 TAGGCATGTAAATGAgt cTGAATACATTTACAGTAT T AAATTTACAATGG 

ganaalcjm 1979 TAGGCATGTAAATGMttTGAATAGATTTACAGTATTAAATTTACAATGA 

c&N&jrat 8501 ACATGCtfatga GACTTGTAAATGAAAGCATTGTGAACTGAaaccg 

g«K>mic_Ku 2029 ACATGCaaaataaaagGACTTGTAAATAAATGGATTATGAACTGAtgata 

c&Xkj&t 6546 agtctctg T0<3ATCTCAAAGCAAACTCTTAACTTAAGGCACTTTg 

gaaGaio_hu 2079 otgatttatttaaTGGATCTCAAAACAAACTTTTAACTTAAGGCACTTTt 

cDKAjrat 8591 MTTTGCCAACAAATAATAACAAACAttaa gagaaa aaaa tgat cCACTA 

yenomiejm 2129 ATTTrGCCAACAAATAACAATAAACAaacattgaaacggtt CACTA 

oDNA_rat 6641 CGAAATAACAAACGGCTAATGGACCTGAATTctcagtaaaaagacotttc 

9&AaftiqJxu 2 175 TAAMTAACAAATGGCTAATGTACCTOAATTtttcagt a a aa a aa tgaac 

OK&jtat 8691 tctcgctaac^gttgCCAGCTGCCTCGTOTCTGTTTCCTACCAATGTCAC 

gonomlcjb^ 2225 ttctaata CCAGTTG<XtAGTGTCCACCTCCTATCAATGTTAC 

cOHA.raft: 8741 AAACAt og caca cagggtGAATGWGTCaVACGGGAAAGATTAAOTTTGCG 

gonomic_hu 2268 AAGCAtggcactca GAAGAGAGACAATGGAAAATATTAAATCTGCA 

<3>HA_rat 6791 GTCtgtgtaaatctca ATGTACAAATATTGTGt cncTG 

gexx©alc_hu 2314 ATCtttatgatgtaaatttaccatcctgATGTATAAATATTTTG TG 



exonl (2342-2397) 



oDMA_rat 8791 GTCtgtgtaaatctca - ATGTACAAAT ATTCTGt en cTG 

genoaJ.c_hu 2314 ATCtttatgatgtaaatttaccatootgATGTATAAATATTTTG- TG 



cDNA_rat 



8629 
2360 



GTTTATAAACATTTTGATAAAACCGAAAAAAAAAAAAAaaaaaaaaaaaa 
GTTTATAAATTTTTTTGCTAAAACX^TACAjGAAAAT AAGca ctgaa ct gt c 



(Genomic human OCP: SEQ ID NO:4) 
(cDNA rat: SEQ ID NO:5) 




* * • '*.% . 
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Figure 1 1 
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8 
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9 


15358 
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1039 




10 


16864 


-17760 


-897 


? 


Last exon might be complete 


11 


?(1) 


2317 


-231 
7 


25 


Exon is not complete and start site is 
not known 


12 


2342 
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56 
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FIGURE 12 



608 Human translated nucleotide sequence (°^) 

ATGAAGGTAAAAOGCAGAGGAATCACCTGCTOCT^ 

GATCTGCCTGGTCGCCACCCCTGGGGGCAAGGCCT 

cctgttatatgcctacggaggtacactgcacatttcggtacctgacttcca 
tcccagacagcatcccgcccaatgtggaaccx:atcaatttaggatacaac 

AGCTTOGTTAGATTGATGGAAACAGATTTTTCTGGCCTGACCAAACTGG/l 

GTTAC'rcATGClTCACAGCAATGGCATTCACACAATCCCTCACAAGAC^ 

CTCAGATTTGCAGGCCTTGCAGGTCTTAAAAATGAGCTATAATAAAGTCC 

G^CTTCAGAAAGATACT™^ 

CACATGGACCACAACAATATTGAGTTTATAAACCCAGAGGTTTTTTATGG 
GCTCAACTTTCTCCGCCTGGTGCACTTGGAAGGAAATCAGCTCACTAAGCT 
CCACCCAGATACATTTGTCTCTrrGAGCTACCTCCAGATATTTAAAATCTC 
T^CAT^AGTTCCTATACTTGTCTGATAACTTCCTGACCTCCCTCCCTCAA 
GAGATGGTCTCCTATATGCCTGACCTAGACAGCCmACCTGCATGGAAA 
CCCATGGACCTGTGATTGCCATTTAAAGTGGTTGTCTGACTGGATACAGCC 
AGATGTAATAAAATGCAAAAAAGATAGAAGTCCCTCTAGTGCTCAGC^ 
GTCCACTTTGCATGAACCCTAGGACTTCTAAAGGCAAGCCGTTAGCTATG 
^;^^:Ar. T r.r A r : r-rrTrrAr,TfiTGCCAAGCCAACCATTGACTCATCCCTG 



GTCCACTTTGCATGAACCCTAGGACTTCTAAAUOCAAUC^ui i a^iai" 
GTCTCAGCTGCA(iCTTTCCAGTGTGCCAAGCCAACCATTGACTCATCCCTG 
AAATCAlAAGAGCCTGACTATTCTGGAAGACAGTAGTTCTGCTITCATCTCT 

ccccaaggtitcatggcaccctttggctccctcactttgaatatgacagat 

cagtctggaaatgaagctaacatggtctgcagtattcaaaagccctcaag 

gacatcacccattgcattcactgaagaaaatgactacatcgtgctaaata 

cttcattttcaacatttttggtgtgcaacatagattacggtcacattcagc 

cagtgtggcaaattttggctttgtacagtgattctcctctgatactagaaa 

ggagccacttgcttagtgaaacaccgcagctctattacaaatataaacag 

gtggctcctaagcctgaagacatttttaccaacatagaggcagatctcag 

agcagatccctcttggttaatgcaagaccaaatttccttgcagctgaaca 

gaactgccaccacattcagtacattacagatccagtactccagtgatgct 

caaatcactttaccaagagcagagatgaggccagtgaaacacaaatgga 

ctatgatttcaagggataacaatactaagctggaacatactgtcttggta 

ggtggaaccgttggcctgaactgcccaggccaaggagaccccaccccaca 

cgtggattggcttctagctgatggaagtaaagtgagagccccttatgtca 

gtgaggatggacggatcctaatagacaaaagtggaaaattggaactcca 

GATCiGCTGATAG'rnTGACACAGGCGTATATCACTGTATAAGCAGCAATT 
ATGATGATGCAGATATrCTCACCTATAGGATAACTGTGGTAGAACCT^GG 
TCGAAGCCTATCAGGAAAATGGGATTCATCACACAGTTTTCATTGGTGAA 
1 ^ . T-r-m a rrrr ArTr-.r.T ATCr.CAGATGCCTCTATTAGC 
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TATGGTCCCGGCCACTAAAGCTTTGAACCTTCCAGCAAGGACAGTGACTG 
CTGACTCCAGAACAATATCTGATAGTCCTATGACAAACATAAATTATGGC 
ACAGAATTCTCTCCTGTTGTGAATTCACAAATACTACCACCTGAAGAACQC 
ACAGATTTCAAACTGTCTACTGCTATTAAAACTACAGCCATGTCAAAGAA 
TATAAACCCAACCATGTCAAGCCAAATACAAGGCACAACCAATCAACATT 
CATCCACTGTCTTTCCACTGCTACTTGGAGCAACTGAATTTCAGGACTCTG 
ACCAGATGGGAAGAGGAAGAGAGCATTTCCAAAGTAGACCCCCAATAAC 
AGTAAGGACTATGATCAAAGATGTCAATGTCAAAATGCTTAGTAGCACCA 
CCAACAAACTATfATTAGAGTCAGTAAATACCACAAATAGTCATCAGACA 
TCTGTAAGAGAAGTGAGTGAACCCAGGCACAATCACTTCTATTCTCACAC 
TACTCAAATACTTAGCACCTCCACGTTCCCTTCAGATCCACACACAGCTGC 
TCATTCTCAGTTTCCGATCCCTAGAAATAGTACAGTTAACATCCCGCTGTT 
O CAGACGCTTTGGGAGGCAGAGGAAAATTGGCGGAAGGGGGCGGATTATC 
: t g AGCCCATATAGAACTCCAGTTCTGCGACGGCATAGATACAGCATTTTCAG 
*fl GTCAACAACCAGAGGTTCTTCTGAAAAAAGCACTACTGCATTCTCAGCCA 

CAGTGCTCAATGTGACATGTCTGTCCTGTCTTCCCAGGGAGAGGCTCACCA 
CTGCCACAGCAGCATTGTCTTTTCCAAGTGCTGCTCCCATCACCTTCCCCA 
AAGCTGACATTGCTAGAGTCCCATCAGAAGAGTQTACAACTCTAGTCCAG 
U AATCCACTATTACTACTTGAGAACAAACCCAGTGTAGAGAAAACAACACC 

CACAATAAAATA1TTCAGGACTGAAATTTCCCAAGTGACTCCAACTGGTG 
CAGTCATGACATATGCTCCAACATCCATACCCATGGAAAAAACTCACAAA 
ViJ GTAAACGCCAGTTACCCACGTGTGTCTAGCACCAATGAAGCTAAAAGAGA 
;i TTCAGTGATTACATCGTCACTTTCAGGTGCTATCACCAAGCCACCAATGAC 

TATTATAGCCATTACAAGGTTTTCAAGAAGGAAAATTCCCTGGCAACAGA 
ACTTTGTAAATAACCATAACCCAAAAGGCAGATTAAGGAATCAACATAAA 
GTTAGTTTACAAAAAAGCACAGCTGTGATGCTTCCTAAAACATCTCCTGCT 
I'U TTACCACAGAGACAAAGTTCCCCTTTCCATTTCACCACACTTTCAACAAGT 
ry GTGATGCAAATrCCATCTAATACCTTGACTACCGCTCACCACACTACGACC 
] S s AAAACACACAATCCTGGAAGTCTTCCAACAAAGAAGGAGCTTCCCTTCCC 
?:f ACCCCTTAACCCTATGCTTCCTAGTATTATAAGCAAAGACTCAAGTACAA 
!'U AAAGCATCATATCAACGCAAACAGCAATACCAGCAACAACTCCTACCTTC 

CCTGCATCTGTCATCACTTATGAAACCCAAACAGAGAGATCTAGAGCACA 
AACAATACAAAGAGAACAGGAGCCTCAAAAGAAGAACAGGACTGACCCA 
AACATCTCTCCAGACCAGAGTTCTGGCTTCACTACACCCACTGCTATGACA 
CCTCCTGCTCTGGCATTCACTCATTCCCCACCAGAAAACACAACTGGGATT 
TCAAGCACAATCAGTTTTCATTCAAGAACTCTTAATCTGACAGATGTGATT 
GAAGAACTAGCCCAAGCAAGTACTCAGACTTTGAAGAGCACAATTGCTTC 
TGAAACAACTTTGTCCAGCAAATCACACCAGAGTACCACAACTAGGAAAG 
CATCATTAGACACTCCCATACCACCATTCTTGAGCAGCAGTGCTACTCTAA 
TGCCAGTTCCCATCTCCCCTCCCTTTACTCAGAGAGCAGTTACTGACACAC 
GTGGCGACTCCCATTTCCGGCTTATGACAAATACAGTGGTCAAGCTGCAC 
GAATCCTCAAGGCACAATCTCCAAATGCCAAGTTCACAATTGGAACCACT 
CACTTCATCTACCTCTAATCTGTTACATTCTACTCCCATGCCAGCACTAAC 
AACAGTTAAATCACAGAATTCCAAATTAACTCCATCTCCCTGGGCAGAAT 
ACCAATTTTGGCACAAACCATACTCAGACATTGCTGAAAAAGGCAAAAAG 
CCAGAAGTAAGCATGTTGGCTACTACAGGCCTGTCCGAGGCCACCACTCT 
TGTTTCAGATTGGGATGGACAGAAGAACACAAAGAAGAGTGACTTTGATA 
AGAAACCAGTTCAAGAAGCAACAACTTCCAAACTCCTTCCCTTTGACTCTT 
TGTCTAGGTATATAT1TGAAAAGCCCAGGATAGTTGGAGGAAAAGCTGCA 
AGTTTTACTATTCCAGCTAACTCAGATGCCTTTC^TCCCTGTGAAGCTGT1 , 



□ 

m 

v.} : 
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GGAAATCCCCTGCCCACCATTCATTGGACCAGAGTTTCAGGACTTGATTTA 

TCTAGAGGAAACCAGAATAGCAGGGTCCAGGTTCTCCCCAATGGTACCCT 

GTCCATCCAGAGGGTGGAAATTCAGGACCGCGGACAGTACTTGTGTTCCG 

CATCCAATCTGTTTGGCACAGACCACCTTCATGTCACCTTGTCTGTGGTTT 

CCTATCCTCCCAGGATCCTGGAGAGACGTACCAAAGAGATCACAGTTCAT 

TCCGGAAGCACTGTGGAACTGAAGTGCAGAGCAGAAGGTAGGCCAAGCC 

CTACAGTTACCTGGATTCTTGCAAACCAAACAGTTGTCTCAGAATCATCCC 

AGGGAAGTAGGCAGGCTGTGGTGACGGTTGACGGAACATTGGTCCTCCAC 

AATCTCAGTATTTATGACCGTGGCTTTTACAAATGTGTGGCCAGCAACCCA 

GGTGGCCAGGATTCACTGCTGGTTAAAATACAAGTCATTGCAGCACCACC 

TGTTATTCTAGAGCAAAGGAGGCAAGTCATTGTAGGCACTTGGGGTGAAA 

GTTTAAAACTGCCCTGTACTGCAAAAGGAACTCCTCAGCCCAGCGTTTACT 

GGGTCCTCTCTGATCKjCACTGAAGTGAAACCATTACAGTTTACCAATTCCA 

AGTTGTTCTTATTTTCAAATGGGACTTTGTATATAAGAAACCTAGCCTCTT 

cagacaggggcacttatgaatgcattgctaccagttccactggttcggag 

cgaagagtagtaatgcttacaatggaagagcgagtgaccagccccagga 

tagaagctck:atcccagaaaaggactgaagtgaattttggggacaaatta 

ctactgaactgctcagccactggggagcccaaaccccaaataatgtggag 

gttaccatccaaggctgtggtcgaccagtggagctggatccacgtctacc 

ctaatggatccctgtttattggatcagtaacagaaaaagacagtggtgtc 

tacttgtgtgtggcaagaaacaaaatgggggatgatctgatactgatgca 

tgttagcctaagactgaaacctgccaaaattgaccacaagcagtatttta 

gaaagcaagtgctccatgggaaagatttccaagtagattgcaaagcttcc 

ggctccccagtgccagagatatcttggagtttgcctgatggaaccatgat 

CAACAATGCAATGCAAGCCGATGACAGTGGCCACAGGACTAGGAGATAT 

ACCCTTTTCAACAATGGAACTTTATACTTCAACAAAGTTGGGGTAGCGGA 

GGAAGGAGATTATACTTGCTATGCCCAGAACACCCTAGGGAAAGATGAA 

ATGAAGGTCCACTTAACAGTTATAACAGCTGCTCCCCGGATAAGGCAGAG 

TAACAAAACCAACAAGAGAATCAAAGCTGGAGACACAGCTGTCCTTGACT 

GTGAGGTCACTGGGGATCCCAAACCAAAAATATTTTGGTTGCTGCCTTCC 

AATGACATGATTTCCTTCTCCATTGATAGGTACACATTTCATGCCAATGGG 

TCTTTGACCATCAACAAAGTGAAACTGCTCGATTCTGGAGAGTACGTATG 

TGTAGCCCGAAATCCCAGTGGGGATGACACCAAAATGTACAAACTGGATG 

TGGTCTCTAAACCTCCATTAATCAATGGTCTGTATACAAACAGAACTGTTA 

rrAAAGCCACAGCTGTGAGACATTCCAAAAAACACTTTGACTGCAGAGCT 

GAAGGGACACCATCTCCTGAAGTCATGTGGATCATGCCAGACAATATTTT 

CCTCACAGCCCCATACTATGGAAGCAGAATCACAGTCCATAAAAATGGAA 

CCTTGGAAATTACiGAATGTGAGGCTTTCAGATTCAGCCGACTTTATCTGTG 

TGGCCCGAAATGAAGGTGGAGAGAGCGTGTTGGTAGTACAGTTAGAAGTA 

CTGGAAATGCTGAGAAGACCGACATTTAGAAATCCATTTAATGAAAAAAT 

AGTTGCCCAGCTGGGAAAGTCCACAGCATTGAATTGCTCTGTTGATGGTA 

ACCCACCACCTGAAATAATCTGGATTTTACCAAATGGCACACGATTTTCCA 

ATGGACCACAAAGTTATCAGTATCTGATAGCA^GCAATGGTTCTTTTATCA 

TTTCTAAAACAACTCGGGAGGATGCAGGAAAATATCGCTGTGCAGCTAGG 

AATAAAGTTGGCTATATTGAGAAATTAGTCATATTAGAAATTGGCCAGAA 

GCCAGTTATTCTTACCTATGCACCAGGGACAGTAAAAGGCATCAGTGGAG 

AATCTCTATCACTGCATTGTGTGTCTGATGGAATCCCTAAGCCAAATATCA 

AATGGACTATGCCAAGTGGTTATGTAGTAGACAGGCCTCAAATTAATGGG 

AAATACATATTGCATGACAATGGCACCTTAGTCATTAAAGAAGCAACAGC 

TTATGACAGAGGAAACTATATCTGTAAGGCTCAAAATAGTGTTGGTCATA 
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CACTGATTACTG1TCCAGTAATGATTGTAGCCTACCCTCCCCGAATTACAA 

ATCGTCCACCCAGGAGTATTGTCACCAGGACAGGGGCAGCCTTTCAGCTC 

CACTGTGTGGCCTTGGGAGTTCCCAAGCCAGAAATCACATGGGAGATGCC 

TGACCACTCCCTTCTCTCAACGGCAAGTAAAGAGAGGACACATGGAAGTG 

AGCAGCTTCACTTACAAGGTACCCTAGTCATTCAGAATCCCCAAACCTCC 

GATTCTGGGATATACAAATGCACAGCAAAGAACCCACTTGGTAGTGATTA 

TGCAGCAACGTATATTCAAGTAATCTGA 
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Figure 13 
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Figure 14 



rat w oDMA 

human__5+ 3__co rrect ed 
mis cONft 5 



hunian_5+-3_corrected 
mus cDNA 5 



rat w cDKA 

hvman_5+3_correcte<3 
snis cDNA 5 



ratjcDNA 

huwac_5+3_oorrect^J 
©us cMJA 5 



rat_cDHA 

hunSn_5+3_corrected 
mas cDNA 5 



htman_S+3_corrected 
inus cDNA 5 



rat_CDtt& 

human_5+3_corrected 
mis cDNA 5 



rat_cDNA 

h\man_5+3_corrected 



ret_cDNA 

ko»aji_5+3_cc>rrected 



rat_cDNft 

hizoan_5+ 3_cq rrected 
aais cPNA 5 



human_5+ 3_corrected 



rat_cDNA 

3iuH«in_5+3_co rr ted 
mil* cDKA 5 



rat_cDNA 

huraan 5^ 3_corrected 
anas cDKA 5 



GTGCTi 



GTGATCGATTAGTACTT(7I3^^ GAGAACTAA.CTAACCTATACTA 

^TCflfiGGA^AGGAAGAGCATT CCAGCAC^^ 



CCCOCftflCT AHjWIJMGCCTACtMQCCTTTTGCTC^ 

*** ** 4 ****** * * 

^OTGCmCTGGTCXCCW^ 
TCAGCTGCTTOCTGATCTCCCTCAC^^ 

*** * ***** ** ***** * ********** * ********** *** ** 

************ ******** * € ********* *** 

^CTTCCATCCCftGACAG^ 

**** ^********** * ****** ************* * ******** ** *** 
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huna5n_5+3_correcte<i 
mis cUNA 5 



AGCCTCACTRGAX^^ 

*** * ******** **** ** *** ******** ** **************** 



huaan_5+3_correcte<i 
mis cEna 5 



CTGCACAjCTAATG<XMTC^^ 

** ***** ************* * ** *************** * ****** ***** 



ratjCDWA 

human^S + 3_oo rrect ed 
JQU8 CDNA 5 



C&GOTCTTAAAAA1X3AGCT 
CAG<JTCTTAAAAATGfVK^A!EAACAA 

*********************** ******* ** ** *** ******** ** ** 



humn_5+3 corrected 



CTCGGGAGCTTGGTCC^ 
CTCAGGAGCXT<5^0C0GG1TOTVC^^ 

*** ******** ** ****** **** ******** ***** ** ** ***** *** 



X4t — CDNA 

hunan_5 + 3_oo rr ected 
mas cONA 5 



GCGTTTTACGGftClXy^^ 

* ***** ** ** * ** ****** **** ** * ******** 4 *** ** *** 



rat_cDNA 

fauman_5+3^correoted 
was cDNA 5 



CTGCATCC5U3AJCAGAa^ 

CTCCAOCC*tf3ATACATTTG^ 

CTCCATCCRGAQU^TITOTCTCTrTGAG^ 

***** ***** *********** ** ***** **************** *** ****** 



XAt^CDNA 

hwMm_5+3_correoted 
mus cDNA 5 



AAGKACCTGTACITOTA3X3A3MCT 

*** * ***** ********** * ************ **** ********** 



rat_cBNA 

huci&n_5+3_correoted 
mua cONA 5 



C^KCAAACCTAjGAAAGCC^ 
TAMCCTGAOC^G^tSCCTTI?^^ 
TATGCCAAACCXACRAW3CCTTTACTTGCATC 
***** ******* ***** ** ************************* ******** 



ratjODNA 

hi3aSn_5+3jOOrreoted 
mud CDNA 5 



MAGTGCTTCI^^CT^^ 

AAAiGTOGTTGTCCGAGTGcai^^ TTGTrTG 

************ ** ***** **** **** **** ** * * 



ratjcDNA 

human_&+3_ - correct«<i 
aus cDNA 5 



AAGCTCTTCCROTCX^CAGC3^^ 

TTTCTTTITTTATARKAOGTATTTT(XTCA^ 

* * * * * *** * ** *** * * * **** * 



rat^oDNA 

himEn_5+3_correoted 
mus oDHA 5 



GACTCTTTQCTAT<3CTACCATa^^ 
*VKXXjTTAGCXAT<5<n^^^ 

KXXXATAACCTCCCOCCCA CTTCOCTAOCTAOGCATTC — CCATTTTTTGGC 

**** ** ***** *** + ***** 



ratjCWA 

hunaa_5+3_correct£d 
due ot*JA 5 



CCClH5AAATCAAAGAlGCCT^ 

CCTGGCATTCCCC — 

***** 



rat_cDNA 

human_5*3_COrr^ctftd 

mus cDNA 5 



AAGA3TTCMAGAACCCTTTGGCTO 

AAGGT rTCATO<K^CCCTTT (XXTTCXX^TCACTTTGAAXATGACAG AT CAGT CTGGAAAT G 



rat_cDNA 

hunaii_5 +3_corr^cted 
jpub cDHA 5 



AGC^XGACAlWrCTOTAG^ 
A^CTAACATGGTCTGCavSI^ 



32/90 



ratjoDNA 

huaan_S+3_corrected 
mis cDNA 5 



AAGAAAATGACTACA?CGT<Xra^^ 



rat_cDHA 

human_5+3_corrected 
taus oDNA 5 



JOTATAATC^OJCXAGCEAOT 



huaan_5+3_oorrected 

XRUe CDNA 5 



XACTAGRAAGGW&OCACTTGCTTAGTGAAAl^^ 



rat_cDNA 

human_5 +3_correc ted 
roue cDNA 5 



TC^CTCCTAAGCCTGAAGACATTm^ 



rat_cDNA 

human_S+ 3_co rxected 
mus cSma ¥ 



^TTGGTTOC&ACAAGMA^ 
CTT<^5?AATGO^ftGACCfcAAir^^ 



rat^cDKA 

hufi^_5+3_corrected 
xmifl c5na 5 



CATTACAGATCCAGTTTTra^^ 
CAjmCAGATCCAC^^ 



X*t_oDNA 

hiBnari_5+3_corr«cted 
mas cDNA 5 



^(jGAGAGACtCAAATQGACCATGATC^ 
CAGIt5AAACACAAAT<3<y^^ 



rat^cDtiA 

hynSn_S+3_corxeated 
mas cDNA 5 



TCX^OTTGGCGGCRCTAOT 



rat_ctNA 

human_5 * 3 — co rrec ted 
mis cDNA *5 



tGGAATGGCTTCTMCTG^^ 
TGGfcTTGGCnrKaAGC^ 



ratjcDNA 

huaan_54 3_correoted 



GG&TCCTAAIAGAC&AA^ 



rat_oDNA 

htttBan^5+3jCorrected 



CTCmACC*CTGCAia»^^ 
GCGTAlMCACTOTATAfttSCAflCAATtATGATG^ 



rat_ODNA 

httaan^5+3_ccrr©ated 
«U8 cDNA 5 



OT5TGOTAGAGCCCTAT€GAGAAA^ 

TA1CA<5<?AAAATGGGAMCAT<^CR^^ 



rat_crm 

*iux&aa_5+3_corrected 
mus cDNA 5 



CGSCTGAGAOGOTXACCT 



rat_CDNA 

huroan 5 + 3_^CO rrec t e d 



G<^TTCTTCCAG«^CA^ 
GGGTTATTCOUSGAAACAATG^^ 



ratcDNA 

b W oan_5+3_correct<i<l 



rat cDNA 



rat cDNA 

huma^_5+3_cor£ect*a 



rat 

traa£&J5+3 corrected 



rat cDNA 

hunian_5+3_OOrrftotAd 



ratjcEWA 

hufl*an_S+3_correoted 
xcaxa cDNA_J5 



rat cDNA 

hunSn__5+ 3_co r rioted 
mus_cWA_5 



rat eCflfclfc 

hun5n.5+3„correoted 

J0U8 cONAJS 



rat COMA 

huaa^s^correot* 1 



rat cDNA 

htinaii„5+3__oorxectad 



rat oDSA 

his^_5+3__corr*ot©d 



rat_cCNA 

huraan_5+3_con:feota<l 



rat_cXtt*A 

hunan_5+3_oorr*cte<a 



rat_cDNA 

ku»WLn_M 3_co rr^cted 
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^^^^ 



TGGTT 
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ratjcDNA 

huoan_5+3_oorrected 

nus_CDNR u S 



. CAGOCA QGAW3^)t^rCrTOO^ 

<»GGGAA/3AGGAM3U3A^ 



r«t_ct)NA 

h\ma»J5+3_corrected 
ZffUff cDNA 5 



AAAGATGOTCAAIGTCftftftNATCXJrTAOT — TtftG 



rat_cDNA 

huiaai\_5 +3_oo rr ected 
xou£ CDNA 5 



AGCCAATAAATCCAACA/yAAGTTAX^^ 



rat_pDNA 

human_5+ 3_co rr ected 

2BU3 cStJA 5 



CXAGCAGOMT^CATCltW 
CCT^^C^TCRCTTCTAIlCrCA^^ 



rat^cDNA 

huma^_5+3_corrected 
SO10 cDNA 5 



CAGGI?CftCACACX2*CT^^ 



hurwn_5+3_ccrr ected 
mus cBNA 5 



ACMMXCGCT(OTCA<»0GCm<3^^ 



rat_cDNA 

humaA_5+3_oorrected 
jaus C0NA 5 



AAAAC(X»CATAflAAOaX^ 
TCAGCCCATAT&GAACTCCAGTTCTG^^ 



rat^oJaaA 

human^_5+3_corrected 
nrus cDNA 5 



T<3*AGGGAOTGCT2ACA^ 
CCAGAGGTTCTTCTGAAT^ 



rat_cDNA 

hwnwiLO_s+ 3_correotad 
JOUfl oDNA 5 



CCCAC3VGAI0TCCTTCCG^ 
OXCaXTKXMTCTTCOGfl^^ 



ratjcDNA 

huiwn_5+3_corr ected 
EU8 oDNA 5 



(HTCATCCCftGAGTGCCCTO 
OT<K7TGCTCOCArCAOCTTC00CAAA^^ 



rat_cDNA 

himSn_5+3jCcrrected 
urns CDdA 5 



COWSTOTGCTCAAGM^^ 
CAACTCTAGTCCAGAATOCACTAirTACT 



ratcWA 

hwan_5+3_c<>rr ected 
nuaa cDNA 5 



TAATAACAACCACTA^AT^TAn 
19tlACAACACCCAC3A3M^ 



rat_cDNA 

himan__5 + 3__co rr ected 
was cDNA 5 



CAA<XAT<aCmrcCTCCAACAICT 
CAGTC^TGRCTVTATOClXXAACATCCAtA 



rat_cDNA 

human_5+3 — corrected 

mil a cDNA S 



(TTCACX^GAGCAT(5CCTGGGA^ 
GTTACCCACGTGIKTTCTAG^^ 
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ass,.— 

xmis_cDNA_5 
CDNAJ5 

nuiSjCDNAjS 



rat_cDua 

human_5 +3_co rrected 
mas cDNA 5 



rat_oUNA 



rat_ccflA 

human w 5+3_oorrccted 

BEU3 CDfcCVjS 



rat_cDNA 
mufi cDNA 5 



rat cDNA 

hu»axx_S+3_oorrected 



rat.d«A 

huxnan_5+ 3_co r rec ted 
mu5 cDNA 5 



rat cDNA 

h^nan_5+3_oorrecte<i 



ratjcBNA 

htBMLn_5f3_oorreoted 
jmia cDKA 5 



rat cPWft 

huaan_5+3__oorrect*d 



rat oDNA 

huicSn_5+3_oorrect^d 
ia»ejCDKAJ5 



ratjcBNA 

humjn 5+3_porrected 
cDNA 5 



rat oDNA 

human_5 + 3_coxrect ed 

saws cDfc5A_5 



rat_cDNA 

human_S+ 3_oo rr ©c ted 



rat_cBN& 

huroan_5 + 3_corrected 
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hunan_5+3jt30tr«cted 



TlX^^CTGrQGIACTW 



rat cDCBV 

hua5n L .5+3_ccrrected 



xat_cDNA 

iOT»n_5+3_corrected 



hi»aaji_5+3_oorrected 
mus cDNA S 



ret cDNA 

huflSn_5+ 3_co rreot ed 



CMCTOWCCWTCR^^ 



hm£n_5+3_corrected 
©as cDHA_5 



human 5+3 corrected 
ame cDNA 5 



ret_cDNA 

hmnan_5+3_corrected 
mus cDNA 5 



rat_CDKA 

human_5+ 3_cor reCt ed 



huxaan_5+ 3_cor r eoted 



humaji_5+3_corrected 



htman_5 +3_co greeted 
mus clXth 5 



CCKAGGCT(3rG<rrCGACCA0T^ GCWX^TGGATCCACGTC^COCTM^TG 



rat cDNA 

hun&i__5+3_correcced 



GATCCTTt^TGGTTGGGTCACn^ 
GATCCCTt^ATT<S(3ATCA<^ 



rat_cDNA 

huuvan_5+ 3__co r r ect 
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xat_cDNA 

human_5 +3_co rreo ted 



hunan_5+3j0orreoted 
was oCNA 5 



ACTO«AGQCCrCS«WCKXX^ 



in 



hunian_5+ 3_co rrec ted 
nus_c£)NA_5 



rat_ctt$A 

h\noaA_5 +3^co rrected 
atus oDMA 5 



rat_cDNA 

fcun5n_5+3_ corrected . 
mils CDHA 5 



rat_cDNA 

huaan_5+3„co rrected 
cms cDNA 5 



C^CO<»S<^^GgCKaACTft^ 



rat^oOW. 

hujnan_5+3_co treated 
bub COKR 5 



CTCTCCTTGACT^^ 



a 



rat_cPNA 

kun^_5+3_corr*cted 
mas otWA 5 



rat_CDNA 

fcunan_5+3jCo rrected 
mas cWA 5 



ratcDNA 

human_5+ 3_co rrected 
mas cUttn 5 



rat_ODNA 

h\man_5+3 corrected 
«u* cDKA ¥ 



rat_cDHA 

hujn£ui_5+ 3_co rr«ct ed 
urns oDNA 5 



rat_cDNA 

human_5+3_corrected 
CXUfl cDNTV 5 



TOTTCCTOCCAGCTC^^ 



human_5+3 — corrected 
mue cDNA 5 



AMTTAGGAATGTGAGGCT1TCRQftTTCW?=CGAOT 
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rat cDNA GAGGM3A/»£TGT<^ 

mus cDNA 5 — — — ■ ■ 



touaan_5+3_COrrccted ITAGAAATCCATTTAAT^^ 

jmxs cDNA 5 — — - — — — — 



rat cDNA (XTCTGTGGA3N^SAACOCCCCAC^^ 
hunSo_5+3_oorrecte<i GCTCTOTTOMGGTAAC^^ 

nnsjCDH&j5 • ■ 



rat cDMA 3^TGCXAACRGACX3?VCACAATTOCXXCTA 
hun^_5+3_correoted TTTCx^AT<xaccACAA^ 

mu3 CDNA 5 — ™ - — «— ~— — 



rat_cDNA 

huaan_5+3_corrected 

ttO£ CDNA 5 



CTAAAACAACTOGGGftlS<^TQCRGGAAAATAXC^ 



rat^CtWft 

human_5+ 3_corrected 
mas otfKA 5 



ATATTGAGAAATTAOTCATATXAGA^^ 



xat_cDKA Cftlt5GGM?GGTGaZW^<3CGTGAI5TO<3GG^ 

kuaan_5+3_corrected C^GOGACAGIMAAGGGATCAOT 

»U45 oDNA 5 — « 



rat_cDNA TCCXX3AAGCCftAATGTCMGT<3GACT 
hu»3n_5+3_ccrrected Tax^rAAGCCAAMTVrC^ 

aufl cCNA 5 ■ • ■ 



rat_cDHA TCQmSGAAAAlACAtACTG^ 
huran_$+3_corrected TtftAT«3GRAASACMATTG^ 

tOUS CONA 5 — » 



rat_cDNA ACGACCftfcGGAMTTAI^^ 
huflian_5+3_correCted ATGACAGAGGAAACTA3^^ 

mus cDNA 5* — < — — -— — ~ 



rat_cDNA 

humaix^ 5+3_correct«d 
vsne cDNA 5 



TGTCAGttMGGTTGTt^ 
TTCCAGTAATGATrcTAGCCTACCC^^ 



rat_cDNA 

humanJ5+3 — corrected 
bus CDNA 5 



rat_CDNA A^TCACCTQGGAGACGCCWW^C^ 
human_5+3_corrected AAATCACATGGGAGA < IH5CCTGAjCC^ 



rat_cDNA ATftGAAGTGAGfcTtXTTCAC^^ 
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rat_cDNA 

human 5+3 corrected 
mus cOWA 5 



ATTCTGGGATATACAAATGCACAGCMf^ 



hv«Sn — 5+3_corr«Cted 
sua oDNA 5 



AmTTCKAGIAZCrCTGACMGAA ATAAIAAAGT-CAACAA CATCI5GGCA 



rat_oDNA 
xacos cDNA 5 



GAATmTTTmO<5AW3AAGTT^^ 



rat^cONA 

human_5+3_corrected 
W1B CDNA 5 



ATACATTTA<^<3TfcSTAAAT^ 
AraCATTTAC^ATTAAAXTTAC&AT 



ratjcONA 

Jxu5an M 5+3_corrected 
OT3_cDNA_5 



GCATTGTGAACTGAA ACCGAGTCTCTG — TGGWSTCrcRAACCA?UO€TTAACTTAA 

GGATTATCAACTGMtS^ 



rat_cENA 

kinnan_5+ 3_co r rected 
Tmift cONA 5 



GGCACOTTC^TITW 

GGCACTTTTATTTTGCO^^ ACATTGAAACGGTTCftCTAT 



rat^cDHA 

huwvaii_5 + 3_co rrec ted 
taus cDNA 5 



GAAATftACAMCGGCTAATGCACC^^ 

AAAATMCRAATGGCTAA7<3TACCT TGAACTT-CXAAT 



rat__cDHA 

human__5+3_oorrected 
mus cDNA 5 



AGTrGCCAGCTGCCTCGT^^ 

A CCAGTTGCCTAGTCTCC^^ — A 



rat^cDHA 

h\awu\_5+3_co rr^cted 
nus cDNA 5 



AIG<2*STCAAa3G<rotf^ 

ACAGAGACftM^ AAftATATTAAATCT GC « AATCT--ATGTAIAAAtftTTT 



b\aaan_5+3_correGted 
cDNA 5 



rat_cDNA 

huroaii_5+3_corr«cted 
mus CDNA 5 



-G(n^TAIAAATTTTTTT<X^AAA^ 



TGT- 



AAAA 



(rat cDNA: SEQ ID NO:7) 
(human_5+3 corrected: SEQ ID NO:8) 
(mus_cDNA_5: SEQ ID NO:9) 
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Figure 15 



rat SJXSSSdSSS 

jis&SSr MSSaSSSBESSSSSSSSSB^ 



rat 

mouse_5_corrected 
rat 

h\2man_5+3_co rrected 
mouse_5__correcte<* 




t^Use_5_CQ rrected 

tnou£e_5_correcte<i • 

nous «J5jco rrected — — ** 




mo\ise_5_co rrected 



rat 

buman_5+3_co rrected 
jiouse^S^co rrected 
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rat PL&VPLVELTDTORDaS<3MIProe^^ 
huraan_5+3jcorrected PWTOLPNIP^DDSSOIIALHEBF^ 

mouse 5 corrected — 



rat EW-TVNPQTI^EHLI , DFKLFSVTN^ fiV 

huiaan_5+3_correoted EFSPVVNSQn.m^DFKLSIAlISEAMSKNIN^^ 

aaous e_5_correc ted - — - — ■ : — — — — 

rat J\£T£DSAQAGIAS--SQ$AH^^ 

hwan_5-J-3_corrected TEFQD$DCM<^GI^F^PPITVRaMIfa^^ 

mQuse""5 corrected — ___™ — 



rat m<^^SSSDISSHXtAD1?SFSSHPSG8HTTAS^ 

human_5+3_corrected SVWEVSEPRHNHFySHTTOXI^STFPSDPl^^ 

axnis e~5_corr ect ed — — — ~ - — 

rat IWSI^VKNPHRrmittHRHRrm 

human_5+3_COrreoted IGGRGRl ISPYRTPVLRRKRYSI FR$XTK<^^KSTTAF3ATVLKVTCLSCLPRERLTTA 

mouse 5 corrected ■ 



rat T«u£VPSSSHSAiiPKiTOrv<(?viJ^^ 

human_5+3_corrected £AM£FPSAAPITFPKM)I&RVPSEES^ 

mouse's corrected — — 



rat NHVIgTEASMTSAPTSVSI/^PV^ 
hwaan_S+3_oorrected S0VTP5X»VMTIAPTSXfMEKTHKVN^ 

mouae 5 oorreated ■ 



rat T$tKF£KRKTPIJ3QIFVlWQEOT<^ 

human_S+3_oorreoted AITRFSREXI PfJQQNJVNNHN PKO^XJ^HKVSLQKSTAVMLPKT SP ALPQRQSS PFHFT 

moude"S_cSrrected — 

xat mTSPSKu^SlTOAATCSNK)^ 

hv»an_S+3_corrected TIiSTSVMQIPSNTXMAHim^ 

BOuaejBj corrected — — ■ — — — - 

rat LSTETPT-WSPT&IASVIMSETQ^ 

human_.$+3_corrected ISTCHMPftTTPTmevirYET^ 

mouse 5 corrected — ™ 



rat SaLTTADTPLAFSHSERQDDGQ!^^ 
huzBan_5+3_corrected TJ^P— PAIAFiraPPEim:GlS$riSFHSRTia^ 

mouscTs corrected — — « — — . 



rat LI*SK$<^STTVKRAS-DTP-^ 
human_S+3_corrected I£$KSHC£TTTRKASLOTmmssSATlMm 

mouses corrected — — 



rat VTIYESSRHimH<gQPS££A$PNPEIITC^ 

&uroan_5 + 3_corrected VKLHE3SRHN — LC£*PS$OLEP 8 £TSNLLRST PMPALTTVKSQN S KLT PS P 

xaouse 5 corrected 



rat WPEHKYQIJCTSETIEKGKRPAVSMSPHLSLPK^^ 

huroan_5+3_corrected KAE YQ FWHKPYS D IAEKGKKPEVSMIATTGI^EA'rTLVS W DGQKKTJ^DFDKKPVQfiA. 
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rat 

hufflan_S+3_corrected 
mouse 5 corrected 



^£KHLP YVSLPKTLLKKPR I IGGKAAS^TVPANSD^nTLPC^VGOPLPI IHWT^VSSGX 



r*t 

hunan_$+3_corrected 
mouee 5 corrected 



rat 

human_S+ 3_co rrected 
mouse 5 corrected 



RFXKEITVHS<3STVEI*K<3tfi^^ 



rat 

human_5+3_corrected 
mouse 5 corrected 



LSLY^FTKCTTfcSKPfiG^ 

LSIYr^GFYKC^HPGGQnSIJiVKIQYIAAtmi^^^ 



o 

us 

ru 

i3 



rat 

buman_5+3_corr eot ed 
mouse 5 corrected 



rat 

hmaan_5+3_cx> rrected 
mouse~5 corrected 



PQP$VHWVL YDGTELKPLQLTES RETL YPHGTL YI RS IAPSVRGX CIAI £S SGSERKV 
PQPSVYWVLSDGTEVKFIiOmSIXrLFS«GTLYIFiaA$S 



VMLTWraRVTSPRIEAASQWTEVI^ 



ret 

h\anan_5+3_corxected 
mouse 5 corrected 



GSWTHVYPNGSLFIC^VTEKDSGVYI/SV^ dhkqyfrkq 



Q 

ru 



rat 

huiaan_5+3_correoted 
mouae 5 corrected 



rat 

hUlWLX^_5+3_OQ rreoted 
mouse S corrected 



VI£GKDTCr?tCXASGSPVPEV^ 
ViaOTF0yTX33tSGSPVPEISWSLP!X^^ 



rat 

human_5+3_coxr«cted 
mou*e~5 corrected 



rat 

hunian_5+3_co r rected 
mouse 5 corrected 



VVSKPFLINGLYTNRTVII3mV^^ 



rat 

human_5 + 3_oo rr ec t «d 
mouse 5 corrected 



VTVHPKGTIJKKRNrBI*SI>3ADFTCVVt^ 
ITVHKN<ni^IFlJVRI£I>SADFICVARNEG^ 



rat 

human_5+ 3_correct ed 
mouse 5 corrected 



QAGKPVMlKSVDGNPPPEnm 
QIX^CSTAI^CSVDC^PPPEIIWILPNGT^ 



rat 

human_5+ 3_jco rrected 
mouse 5 corrected 



KGftARNKV G Y I RKLVTLE T GQKPVI LT YAP(^Vf^ ISGBSI^IJICVS DG I P KPK1 K5TTM P 
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rat sgmncDftEflvpomiigsw^ 
human_5+3_oorreoted $GWY&^W<&rn^^ 

TOUse_5jcorrected ; — — * — 

rat IIHYLPBKNLRKTGB^QIiHC^ 

human_5+3_correot*d im*?F$IVTI$<33ttFQl£CVAI^^ 

mous 0^5^00 rrcctsd - — — - — * 

rat LVl^JNLQTSDSGVYKCRAQNIXGTDYATTYICV 

hunan_S+3_corrected lviqnpc^sc^giykctaknplgsdyaatyiqv 
jaouse_5_oorrected — — 

(rat: SEQ ID NO: 10) 
(human_5+3_corrected: SEQ ID NO: 11) 
(mouse_5_corrected: SEQ ID NO: 12) 
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Figure 16 
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m 
ru 

t: 

0 

ni 

ru 

o 



rat 

humaa_5+3_corrected 



rat 

huntcU&_5+3 corrected 



rat 

human_5+3_corrected 
rat 

human_5 +3_co rr^ctfed 
rat 

hujaan_5+3_corrected 
rat 

human w 5+3_corrected 



rat 

human_5+3_oorrected 



rat 

husian_5+3_corr*cted 



rat 

human_5 +3_oo rrected 



rat 

human 5+3 corrected 



rat 

human 5+3 corrected 



rat 

human_5+3_corrected 



rat 

huraan_5+3_corrected 
rat 

human_5+3 corrected 
tat 

human_5+3 corrected 



rat 

hunian_5-i-3__oorrected 



*:*:** **««::.:**«.***. j******.*^,*^********^.*^;™;; 

«.»..*.:**,*♦*****♦** ,.****«. i^.*^;,?l77 

SAFISPQGFMAPFGSLTIJWDQS(S?EA>WC3IQKPSRTSPIAfTEENt>YIV^ 
** ***.*. ***.*.**•«*«♦.**♦ 

***.:»*.**«*«**:**«•*«««»*»»*_ *:***.. j*****^*** 

*:«»«« « : «;:* *«*******,******.*.****.*******_ . 

.*«*. ***.*♦*. ,«.*** ,***,**, ,«.*»«««m««t.»fm„ < » M ,M* 

*****,*,*»**♦,* **«,****♦*.*,* *,.!,-*..*** ♦«♦«,..,..,. 

■ : "* * : ****•*.*• • **•*.. s* »*** «*.«*:.♦.. s:ss 
BML tHRRRGDSTLRRFREHRRQIf LSARRI DPQRHAALLRKAKKNSVPKKQENTTVKPV 
• * ' •****«,*«,,* ***«****,**♦****«»*«. ^.^^^^ 

* ::..**.*:***:. *** **** +* 

E^-TVNFCnXQ$EHI^FKIJSV^^ sv 

*.* .**.* * .**** Zm . **::*.;**;;-*.*.****« «** 

TEFQDS D^GR(3lEtlFQSI^ PITVRTHIKIJVNVKMIiS STTNKL— IJ4ES YNTTNSfl- -QT 
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rat 

human 5+3 corrected 



PITGV$RPSSSDIS8HTTADP$FSSHPSGSH1^^ 
SVRE^SEPRHHflFTSHTTQIlOTSTFPSOPH^^ 



rat 

human S+3 corrected 



* ***. *• ♦*****♦+*« 



* - ** • 



, i 



* . * * ** * 



rat 

human 5+3 corrected 



TAALSVPSSSHSRLPlCrmVGVIIffiESTrVVKKP^ 
TAALSFPSA^ITFPKADIARVPSEESTTLVCOT 

***+*,**•: . * }*****:*:;****:::* ** * 



rat 

humAn^5+3_oorrected 



JffiVIPTEASHTeAPTSVSli^ 

>s * ** * ** ****!.. *. * : . : 2J s *** *{.*..., t< * : * 



rat 

human 5+3 corrected 



TSTKFSKRKTPLHQIFVNNQKKEGM^ 

AITItf SRFXIPtfQCJTCVNtraNP?^^ PFHFT 

j * : ** : «* * •* • tr * ; * JJ,, *** :* ** **** . * 



rat 

human 5+3 corrected 



TI^TSVMQIPSNTLTEAHHTTTKTHNP^ 

** ** * ♦•-•til ** ** - *. ** *-**> *** * * 



rat 

human 5+3 corrected 



,*:** .**** *** **; . * • * 5* * ; 



rat 

human 5+3 corrected 



ALTTAOTFl^SHSPRQDDG<^ 

&MTP — PALAFTBSPFEKTTGISSTISFHSRTL^^ 

*♦* ***.*** 4 +111*** * * ** - **** * * **** 



rat 

human 5+3 corrected 



LSKSQESTTvmS-OTP-PFXJ^SGAPFVPTO^ 

SSKSHQSTTTRKA5LDTPIPPFLSS SATI^FVPIS PPFTQI^VTDTRGDSHFKIMTNTVV 



** + « 4 J** *** **j***„*_ ;*«* .****; 



rat 

human 5+3 corrected 



TiraSS!U^DLQQPSA^PNPEIIT<^ 

KLHKSSRHN — LCMPSSQLEP LTSSTSNIJ^HSTPMPALTTVKSQNSKLTPSPW 

* * +**• **.*** . * **** **** 



rat 

human 5+3 corrected 



AEYQETOKP YS DIAEW33^PEVSMLATTGIiSEATTLV^ EWDGQIQfTKKS DFDKKPVQZAT 

*.**; ****;* *** # < * -> ^*-*^^* % -*j **j«|** ***** *; 4 



rat 

human 5+3 corrected 



TSKHLPYV3LP!CniLKKPittI(^ 

TSKLLPFDSLSIOTIFEKPRlVOGKAAS FTIPAN S W^PCEAYGNPLPTIHWTKV'6-GLD 

*** **] ;****J******** J*****^********}*** ******* * ; 



rat 

human 5+3 corrected 



: *.*^* •**, «*******«*+**«*****;****** * j**|**** A*^***.* 



rat 

human 5+3 corrected 



flVKEITVHFGSTVELKraVEGMP*^^ 
RTKEITVH3GST/E1JCCJIREQRP$PTVT^ 

..****** ***'******,** * *** •***********] j }***; . ** ****;;;*« 



rat 

human 5+3 corrected 



SLYDRGFYKCVASNPSGQDSLLVKI 
SITPRGFYKCVaSNtt^OSIiVKI^^ 

* j*************^************.***** *** ;** *** # * ************ 



rat 

human 5+3 corrected 



QPSVHWVLYIX^IJare^THSP^TLYPNG^ 

QPSVYWVLS DGTEVm^FTNSKLFXJTSKGTLYIFHLAS St)RGTYECIATSSTGSERRW 
****.*** <****..****♦*.*-.**. ******* -* * ******************* 



rat 

human 5+3 corrected 



I LTVEEGET I PRI ASQKVTn^VNI/GRKLT J>JC SATG DFKFRI IWPLP S KA VI DQW5 ERMG 

MLTMSlERVTSPRIEAAS^RTBJVMFGDKLLIiN G 

* ********* ** * * «-*v**.***** <•***•*-*«***«*-*. •»* * 
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human 5+3 corrected swiflTOiK^EaGSVTEKDSG^ 

— " *" * *********^ ;«****»*)* *****<*******;**** ***„****• ;**«*.*** 

ra t ISGKDFQVDCK^SGS PVPEVSWSLPDG^ 

human 5+3 corrected iflGKDFQVDC^GSPV^lSWSI^DGOMlW^ 

~* " ******************* j*******^ J**. ******. ** : *****|*******;** 

hunan 5+3 oorreoted VAEEGDYTCWtfnTiGKDe^^ 

~" — I****** * A***************;** ****** **. *: •**$**********•** 

rat Pim^I^PSmVISFSNIX^FHAK^^ 

human 5+3 corrected PKIF^PSNDfcEGFSIDRmHANGSLT 

"~ *~ *••*****«*..«*** **j***** .*j*,*** **** j***** .*«*****« **♦*. 

rat VSKPnJNGLYANKTVIK^AIRHSKlCnTOU^ 

human 5+3 corrected VSKPPLIl^YTKRTVIKATAYF^^ 

~ ~~ *********♦*,*(*******.*****;****/*;* ****,****.***;**:*< 

human 5+3 corrected TVHKNGTLEllUF/^SDS^^ 

_ *~ *** **>**, **; ******** ** B *^***************** + *********** ; ;** 

rat /USKPVHJTCSVIX^PM^ 

human 5+3 corrected ix^TMJ(CSVIX^PPPEIIWII^<n^^ 

~ *~ **^ ************** ****|** : * : « ** • * B * ** j * j 



rat CAAFNKVCTttEKiaLI^IGG^ PKPKVKWTTPG 

human 54-3 corrected CAAPKKV(OTEKLVIIiElGQKPVTLTTAPCT 

~" *************..*****■******* ** **^ J***.**************;*** *, 

rat GHVIDRPQVTXSK^ILHEMGTLVIKVTTM 
human 5+3jCOrrectfed GYWDfcPQINGKYILHDNCT^ 

*,*•****. .«*★*** j******* .«*}*«*****.*****« j j j*j* a ** < ******* 

rat INraRHHmiGEAMQ^ 
human_5+3_COrrAated Ttm*FWIVTRlX3&&FDIiICtfM^^ 

* **,:; *** *$****«**♦- ***j •*** * ****>^*| ** ** **** 

rat VIQNLQTSDSGVYKCBA3 i ^Ll/^DyMTYIOVi 
human 5+3 corrected VIQNPQTSDSOI71CCTAKNPI/5SDVAATY iqvi 

~~ **** ******;*** *j+ «*•**«•*****] 

(rat: SEQ ID NO: 13) 
(human_5+3_corrected: SEQ ID NO: 14) 
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Figure 17 

MQKRGREVSCLLISLTAICLVVTPGSRVCPRRCACYVPTEVHCTFRDLTSIPDGPANVER 
VNmYNSLTI^THNDFSGl^RLEIXMLHSNGIHRVSDKTFSGLQSLQVI.KMSYNKVQnE 
KDTLYGLRSLTPJLHIJ)HNMEF1NPEAFY GLTIXRLVHI^GNRLTKLHPDTFVSLS YLQIF 
KTSFIKXLYLYDNFTSLPKEMVS S MPMJ3SLYLHGNPWTCDCHLKWLS EWMQGNP 
(SEQIDNO: 15) 

Figure 18 

MKVKGRGITCIXVSFAVICLVATPCK3KACPRRCACYMPTEVHCTFRYL^ 
RINLGYNSLVRLMETDFSGLTKXELLMLHSNGIFrriPDKTFSDLQAD3VLKMSYNKVRK 

LQKDTFYGLRSLTRLHMDHNN1EF1NPEW 

QIFKJSFIKFLYl^DNFLTSLPQEMSYMPDlJ)SLiYLHGNPWTCDCIiLRWLSDWIQPDVI 

KCKKDRSPSSAQQCPLCMWRTSKGKPIJ^SAAAFQCAKPTroSSLKSKSLTIIJEDSSS 

AHSPQGFM APFGSLTLNMTDQS G^ ANMVCSIQKPS RTSPIAFTEEND YTVLNTS FSTFL 

VChTOYGfflQPVWQmALYSDSPLlLERSHLLSETPQLYYKYKQVAPKPEDIFTNIEADLR 

ADPSW1MQDQISLQLNRTATTFSTLQIQYSSDAQITLPRAEMRPVKHKWTMSRDNOT 

LEHTVLVGGTVGLNCPGQGDPTPHVDWLLADGSKVRAPYVSEDGPxILIDKSGKLELQM 

ADSFDTGVYHCISSNYDDADILTYRITVVEPLVEAYQENGIHHTVFIGETLDLPCHSTGIP 

DASISWVIPGNNVLYQSSRDKKX^rWGTLRILQVTPKDQGYYRCVAANPSGVDFLrFQV 

S VKMKGQRPLEHDGETEGS GLDES NPIAHLKEPPG AQLRTS ALME AE VGKHTS STSKRH 

NYRELTLQRRGDSTHRRFRENRRHFPPSARRIDPQHWAALLEKAKKNAMPDK^ 

SPPPVVTQLPNIPGEEDDSSGMLALIffiEFMWATKAU^LPARTVTADSRTISDSPMTNlN 

YGTEFSPVVNSQHJ'PEEPTDFKlilTAILTTAMSKNINPTMSSQIQGTT 

ATEFQDSDQMGRGREFffQSRPPITWTMIKDVNVKMI^STTNKLLLESVNSHQTSVREV 

SEPRHNHFYSHTTQILSTSTFPSDPHTAAHSQFPIPRNSTVNIPLFRRFGRQRKIGGRGR1IS 

PYRTPVLPJimYSIFRSTTRGSSEKSTTAFSATVLNVTCLSCLPRELTTATAALSFPSAAPI 

TFPKADIARVPSEESTTLVQNPLLIXENKPSVEKTTPT^^ 

SIPMEKTHKVNAS YPR VS STMiAKl^S VITSSL^ G ArTKPPMTn AITRFSRRKJP WQQNFV 
hn^HOTKGRIllNQHKVSLQKSTAVMLPK^^ 

AHHTTTKTHM 5 GSLPTKKEIJPFPPLNPMLPSnSKl)SSTKSnSTQTAlPATTPTFPA 

TQTERSRAQTIQREQEPQKKmTDPMSPDQSSGFTTPTAMTPPALAFTHSPPENTTGISST 

ISFHSRTLNLTDVffiELAQASTQTLKSTIASETTI^SKSHQSTTTRKASIJDWPFLSSSAT 

LIvffVPISPPFTQRAVTDTRGDSIiniLMTNTVVKLHESSRH^QMPSSQLEPLTSSTSNLL 

HSTPMPALTTVKSQNSKLTPSPWAEQFWHKPY5DIAEKGKKPEVSMLATTGLSEATTLV 

SDWDGQKNTKKSDFDKKPVQEATTSKLLPFDSLSRYIFEKPRIVGGKAASFTIPANSDAF 

LPCEAVGNPLPTIHWTRVSGLDLSRGNQNSRVQVLPNGTLSIQRVEIQDRFQYLCSASNL 

FGTDHLHVTLSVVSYPPRILERRTKEITVHSGSTVELKCRAEGRPSPTVTWILANQTVVSE 

SSQGSRQAVVTVDGTLVLHNLSIYDRGFYKCVASNPGGQDSLLVKIQVIAAPPVILEQRR 

QVIVGTWGESLKLPCTAKGTPQPSVYWVLSDGTEVKPLQFTNSKLFLFSNGTLYIRNLAS 
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SDRFTYECIATSSTGSERRVVMLTMEERVTSPRffiAASQKRTEVNFGDKLLLNCSATGEP 

KPQIMRLPSKAVVDQGSWfflYPNGSLHGWEKDSGVYLCVARNmGDDLlLMHVSLR 

LKPAKIDHKQYFRKQVLHGKDFQVDCKASGSPVPEISWSLPDGTMINNAMQADDSGHR 

TRRYRLraNGTLYFNKVGVAEEGDYTCYAQNTLGKDEMKVHLTVlTAAPRIRQSNKTN 

KRIKAGDTAAVLDCEVTGDPKPKJQ^LLPSNDMISFSIDRYTFHAJSGSLTINKVKLLDSG 

EYVCVARWSGDDTKMYKLDVVSKPPLINGLYTNRTVnCATAVRIISKKIIFDCRAEGTP 

SPEVMWIMPDNIFLTAPYYGSmTVHKNGTLEIRNVRLSADHCVARNEGGESVLVVQLE 

VLEMLRRPTFRNPRNPFNEKIVAQLGKSTALNCVSVDGNPPPEnWILPNGTRFSNGPQSY 

QYLIASNGSFnSKTTREDAGKYRCAARNKVGYlEKLVILEIGQKPVILTYAPGTYKGISGE 

SLSLHCVSDGIPKPNIKWTMPSGYVVDRPQINGKYII^HDNGTLVIKEATAYDRGNYICKA 

QNSVGHTLrrWVMTVAYPPRrTNRPPRSrVTRTGAAFQLHCVALGWKPEITWEMPDHS 

LIJSTASKERTHGSEQLHLQGTLVIQNPQTSDSGIYKCTAKNPLGSDYAATYIQVI (SEQ ID 

NO: 16) 



Figure 19 




Signal Peptide 



Figure 20 
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Figure 22 
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Figure 24 
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Figure 26 
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Figure 27 





Figure 30 
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Figure 32 
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Figure 34 
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Figure 35 
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Figure 38 
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Figure 40 
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Figure 41 
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Figure 45 
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Figure 47 



TCTAGA AGTAAAATGATCCT 

AATCATCrrtTCTOTGOTTTGC^ 

ACCTAC(X5AATACaGAGCTATGACGTTAGT^ 

GTAGCIXKXCAGAACWTACTAGATG^ 

CTTAAGAATATGGTTGTCTCT^ 

GGTTAGTOCTAACrCTXjACroCTTGACTCT^ 

ATCACTTGGTGATTAGCAACIXKjATATGOAATAT^^ 

CTCAGCTTrGGCAOCTtXJATTCATTCTO 

ACTGTTTCTAC1TACAAGACGCATTACCTO 

GAAGAGTGAATTAGACATATTCTXiATGMGTAAGCATATAAAGT^ 

TGTTCCATGTATGCTCATGGATGCTATrAT 

AGCTTATGATACTTGTCACAGAGTAAATCTTCCATAAATATCA 

TTTCATAATCCATCAATTAAAAACCm 

CTGCCCTAGGATGGTCTGTATGGTGGGCCTC 

AGGAGCACATCTTCAGAACATCnXjCCTCAA 

ACTCACATGTGCnX2AA<XACAGGGATTAAACAGTX!l^ 

GGAAG<XTGCTAGAGGAACAGGATGTATCAGGATAA 

ATAACCCTCACCACAACAAACAAC^ 

GAACTAAG<XATATGCAGAAAAAGTATTATATGrTCrrcA 

AGTCAAATATGCAGAATATAGACAAAGATGGTTTAT^^ 

CTTGTAGATTAGAGGACACAAAGCAACAACTACAGAGT^ 

TATAATATGAGGACTGTATTTAATAATTCTATTTAAGATAC^ 

ACTAACACACACACITAC^TAGAGAGAATAAAGTGATAGATAaJlll 

TACK^XSATAATITCATATTGTACACCTC^ 

AGGTGCXJIXTIXXXAGGGCGGAAGAGTACATTCGA^ 

CCACATGGAGGAGCTAGAGAAAGTAGCCAAGGAGCTAAAGGG 

TGGAACAACATTATGAGCTAACCAGTACXXra 

AAAAGATGGCCTAATCGGCCATCACTGGAAAGAGAGGC^ 

ATGCCX^AGTACAGGGGAATACCAGGGC^ 

CKKJGTGGGTGGATATGGGGGACTITIXjGTATAGC 

CTAATAAAAAATGGAAAAAAAAAAAAAAAAAAAAAAAAAGOAAGGT^ 

TTCACTOCTATCTCAACITGCAAACAGAAGGGGA 

GATIX>AAGCGTCTTrcMTGT^ATTGCI^^ 

TGTGAAAACmCGGGGTCTATGACATGTITrm 

ATATTTGGCACCACTAO^GCCCCAGC^ 

CACACACACACA(^CACXC^C^^ 

(^TCATCTGCTCACTCACCCA 

CTATTTTTATGT(nAGGTTCCC^ 

AGAATTTGTACKXXX^ACKX^CKX^ 

CACKXK^TA<XAGCACAGGAGGAGCA^ 

CCCGGGTCGATGGCCCG<XCATTCCX: 

CCCTCCTCXXX3AGTCAAGACCCCTAG 

TTOG<XX^AAGGCAGAGGTGAGCGGCTGCTAATXXXXTO 

Exonl 

AGTCTGGTAGATX}TGGOAGCTGGGGTTC^^ 
GGTTCAGGAAAAGGAAAGTACTGGAGGGGAGTT^^ 

CATCQCTTCAQTGACreAAqf^^ 

CXK^TTTCATAAAGACTGATQACGTGTTT^ 

GGGACGTCXK3AAAGAAGGA GTGATCGATTA GTAC^ 

TAACTAACgi>TACTATTQAGGG AG AA GG AAGAG^TTCCAGCAGCAGGAGCAGCAGCA 
ffl^AATt^GATAAAQGAAAGCTTTGGTTAGTTTC 

GAAQCGCCTCCAGTTCTCT GAAGAGTCAGTC CX^ AfT^TACTAA 
0<TnTTTGCTCCC^ 
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AOATTTrarroTCTcnscrrATGT^ 

TATKKXJAACltrrmOTTAAATAC^GTTTGA<K3CI^ 

TTCATrTAOGGCTCrciTCrcrrATAACTAACTCrrAAXK^ 

CAGCTACCTTGTA(XATCTITGTITATCT^ 

ACWAGAGAOTATTCAAAAATATTCAGTGATGTAACAGTGACACnXjTAGGCATAGAAGTA 

ATCATTAGTAAATCTTAATTTGGGTTAAACT^ 

CACT0AGCCTTCGCCACGTGC8GGTTAAAGA^^^ 

CTTGGCCATGCTOOCCATCACTOTGTt^GTAAGCAGAGGGGTGTT^^ 

AGCAOACAGTGTTATGCCT0K!AAAG1^GAGACTCAGCCCTKXJCAGCro 

GTCCTCCCGGTCATTAGTTGGCTCIXlAAA 

AG ACATGCTAGAAAG AAATTTGACCT1 111 I rCTAGTGGGTTATTACAGCTGTAAAAGTAT 
TTTGQAAGGrTAAGCCAAATAAATAAAACACATATTAAATAATACAATQTTACAAAAA^ 
QATCATATAAAGAAGTACATTC^TAAATGCAATGTGAAAAATATATATAATTTTT ATC 
TTACTGGTGCAAAGTTTTCTAAATrOCA(^TGTACTATITT^ 

AAAATGTATATAAAAGTGTAAAAGGCTCTTGGTCAAACAAGAGAGTTAAATTTACAAACT 

^r(BamHI) 

TTAATTOTCCCGATAACATTATTATGATCTCrAATG 

AATGAGAAGCTATGAAGATATGTrTACAATAATAAGCCCATTTAGTGATAAAGTCXiAATG 

GGAAGCTAGCACACACTXKJTTTATAAAGAGAACAOTTTCCTGAGTCTATG<^AGTTTACA 

CTCTAGGGAATAAGAGTTCCTCTTTCTCCAGATTTCACrAGCATTO 

TTCTTOATGATGAGCATTATAAGTGGAATAAGATAGGATC^ 

ATGCCCTt3AACAATCTTTCAGGTCTTItnTrCAQTTC^CTAG 

TTGGGGGATGGTGTTAATTTTTTTGCAGTTCTTATGGAATTCCAA^ 

AAACAAACAAAAAACCTCTGAAACTAGAACTACCAAT0CATTACnXK3GTATGTAACAAAG 

agaaatctocacagaatitatrgctacattgtrcattatt^ 

accaacttaogtagccgtcaaaatatoaacggataaagaaaat^ 

agagtccc^totggocataaaagagtgaaatcatgacatatgcaggaaatggatgcaact 

GGAAATCSAAriXKjGCTAATCAAAACAAGACAGACTCAAAAAGGAAACACXXiTQTAGC^ 
CTCTGACAAACAGAAGCTAGATTTAC^CTTGTAC^TGCGCATGTGTGTTTAGAA 
TAGTTATACACTATTCTAATCTGTGAGTGTGTATAAAGX3CATGCATGTAAAGCAAAAACA 
AGCTAGCTGGGGTGGGTAGGAGAGAAAGCAATGAGAGGAGTTAATAAQAACGAAGCATA 

GTAACATAGGTtXXAGGATGAAAlXK^TTAATTIXrrATOCrAACT 
GGCACA(X}TTDW^<XAGGGTGAAATCCCAGCACAGAGAAGGGGAAGTAGACACAAAGT 

TTCGCCACTAACCAAGAAGCCATTIXK^GTTGCTG^ 
CTOCAGTCTGACACTXnXnATAACAA(XAGTKiAGAATACAAAGTTGGCATGA 
rnTTGTGCrATTnTC ATl 11 11 1 1C1 1A CTOTTTTGTTOTTGTGGTTOTTGTIX3TGGTCK3TO 
GCroTGGTTTTCATTIXjTTrcTTTTGAGAGAGAGAAGGAAC^ 
AGCrGK}AAACGATCTGGAAOAAGTTOGGGAAAGAGAAAAATTGTATGGAGCATATTT 

IGAAACAAACAAACAAACAAAAGGITCATTTTGCsCACAAAA^ 
!;AGTTACGACTCTTAAAQAAAATATTCCCAATTATTCC<^GAGTTGCTA 
7TAGGACTTTGCTTGAACTGGC<Xn'ATAACTCTGGTGTGGTGTCT - Artificial 

NotI fit* 

(SEQ ID NO: 17) 
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Figure 48 



CACAGACCTIXXTTCriCT^ 
(^GCAAGCACACrrcTGCTAATGCAAQG<X^ 
GCAGACXXJACACXSTTTXX^ 
OAAACCTXKrrQTQATCrCCATTTCrrCTCT^ 

TTXxrrcAMarrTTCC^^ 

TCATTXKKSAACXCTGTGCT^^ 

CTIXX*TCAGTGGAGTCITTTGTA1^ 

AACTGTTTTITGTTATGATTGCTGCAAT^ 

TCCTTTGlX^XXTOrrTTX^ 

ACX^AAAGTGATATAGATTCATGGCATTCCTCTA^ 

ACTTGGCAGGTGTAAAAGOCTGGAAGCAGTCAOTAGGCAGTTI^ 

CTXK}AACCTTX3GCATT(XCATAGCT 

GGTCTTGCATGCTTTCTTGC^^ 

TAGCAAAAGTATACXITAGAATCTTCATGACAaTCAGGTATTGC 
GAAGAAAGTIXKjTAAATTCTTCTXIACAAATGGAG 

agctaagtcacactcatatgcaagaatttaccaaggcct 

gctitattcatgataaggtcix^ 

aaaggtgtag^taaattgtgactagagtotgaaatcm 

ggcagagctgttttatcrrtactotaaacattac^^ 

tcctctottn'gtgacagatacitctatgta 

ttixnxjtattatataagcctggtottccctttc 

ccttgtgtctgtgt(xttto 

gtctittccac&gattgagggaggccxmcix3ag 

GCTCACTGTTGGTGACTA(XrrC^ 

AAGTGTGGGTCCAGGAAGCAGGTTGTACACXXriXjTCTO 

AAATAAGTTAAOX^TCTCTlXXriX^ 

TGCIXKHXKJTGTGTCTC^ 

TGTCCATTGTTTCTTCAGTTCCTGT^ 

AATCntK3GAATGACTTTTCTAAGAAATGCCACGTG 

TAAACAAGATATATGGTGCGCAGTITATAATCATAATAAGCTTTGAAATAATATATAA 

CATTCTCATTCIKXTTT^ 

AAAGTAGTGATTTATGACCTGCrrTGTT^ 

ATGTTGAAACATGGAACnTGGGGTOACCTO 

TTTOTCTCCAGAATAAATOAGTrTATCAACTrc^ 

<X<XCGTGGAGTCCCAC(^TTCT 

ATGTGATIX^rrO^CTXXAAATAT^^ 

ACTAAGTCTATTGATTTGGAAGATCTCAGTGG 

TCAGGCTAAGCIXIAAAAATATGACAAATGAAATGTCAAAT^ 

AGTGTTAGGGGGTirTAAAGAAACAAATACXjrAClCT 

CTATTCTAGTTTTGAATAATXnTCCCTAGTATACAGCAAT^ 

TTATCITGACCAATCTC^GTGACTT^ 

AGGTCTCATTCAACCACATCTTrATCATT^^ 

ATTCTtXX^GAGCACTGGAATTO^ 

TGTGCTAAA(^GAGAaTTACAAGA(XrnTTTATGTGGA 

TGACAGAATGTCAGAGTGGAGCTCXX^CTXKK^ 

TTGGAACATCACACAMGAAATTCCAAATT^^ 

ACTTTTGACTCrrGAACATGAGCT^ 

GTGAAGGAAAGCTCGCCAACTCCTTTTTATATO 

GACAGTATGCCAGIXXX^GATCTGTC^TATGACTGAAG 

AAGGCAGGAGAAGGCAAGCTACAGTGAAGCCCAGTIX>CTATAAAGC^ 

AGATAAGAAGGGTTTCTAATTTTTAAA i U H 11 1 ATTAG ATA1TTTCTTC ATTTACATTTC 

AAATGCTATCCCAAAAGTCCCCTATACTCCCCCCTCAC^ 

CCATTTCrrGGCCCTGGTTACTTGTC 

ACTATCTGGTCTATTGCAAGAATGGCTGCCGAGCTC 

AGCCAAACAGlXK3AAGGAACTTGGGGAClXnTATGGAAGAAAAGG 
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(XXXXXXUTG<3<XUAAGGAACTCC^ 

ACOCnX^KKntn^OAOT^ 

<XraCTCX*CTCATATXnAACAGAtt^ 

GATGGGGGTTAGGGGTGGGGATGGGGGTrATCTOtAAAGC^ 
TOTTOTACm^GCTG<XnAGTC^ 
GTAAAGACITGAAGTTCTGAOKmm^T<XLiG<X^^ 
TACtoAA<KOAAaAOOAO<XXKJAAGACTT(X3G^ 

A ™ A j^ AAGAAAAA iV^^ 

TACTCTAAAACCATTXX^TCCC^^ 

OATA £!j?£!^ TA0A ^ 

OAACCi J i i\l J ATTOAAGCTTTCACACAGAGCClTGTrCITT^ 

TAGCrTAATGATGTTC^GTGAATTAAAMTAACAATGAAGAATAAAGACAACTOTATT^ 

AAGGATICnxXITATATATTTAAAAATCTAAGaTXraTC 

TTTCTATATATGriTACrCTATOGTATGTTAATrAAmTATGCAA 

CAACATAGTATaTAACTTATAAGAAAOTAAAAC^TTCATGAAATTCrraA^ 

CTTACXXnX^GAAACACTGGGTrrGAATAATTCTrATTTrGCT 

AGAATCTAATAACCTACTAAGGOvAACATAGAA(mX3AAATTAAAAAGAGTAGA^ 

QAAGTAATAAGGCAAATAATGAATATTTGCrTTAAATAGTTCTTAATGTATX^ 

GGGlXJlXJATTCltXAGAClTGAClXXATCCAAAATATCCAAAATO 

TOAAA i^ TOTO,rraA AAA^^ 

TIXKTITITATTGGTGTGATAAGGACCATGACCAAAGCAGCTTAT 

(SEOIDNO.18) 



• 
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Figure 49 
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Figure 50 

TAAGCCTITrGCTCCCGTTGGAAGCAAAGAACGTTCCrTCAATCAGGTGAAGGCTCT 

CCTCAGAAGATTTCATGTCTCAGCTTATGTTACAAGAGGATTCAAAAGCAAGACAGA 

AGAGCTCAGGTATAGCCAACTCTTTTGTTAAATACAGTATGAGGCTTAAGTGTACGG 

CAACTCATGTGGTATTCATTTGCGGCTCTCrrCTCITATAACTAACTCrrAAGGTGCA 

TATAGTCTCITCTGriTCCCAGCTACCTTGCACCATCTTTGTTTATCTAATAATAGCA 

AGCTCATCTGCTTTTTAATCATCACGCAGAGAGTATTCAAAAATATTCAGTGATGTA 

ACAGTGACAGTGTAGGCATAGAAGTAATCATTAGTAAATCTTAATATGGGTTAAACT 
CATTCATAACAGCTCCAGGTTGG (SEQ ID NO- 19) 



TTTGGAACCAACCCAGATGCCCCTCAACAGAGAAATGGGCCAGAAAATGTGGTCCA 

TTTATCCAATGGAATACTACTCAACTTATTAAAAACAACGACTTTCATAAAATnTTA 

GGCAAATGNATGGTCTGNAGGATCTTGAGTGAGGTAACCCAATCACAAAAGAACAC 

TCATGGTATGCACTCACTGATAAGTGGCTATTTGTCTATGGAGTGATTTAAAAGGGA 

AGAAGACACATAGCTTTTTGTGTGTATAATATTAAGATGGAAATTTGCCAGTGCTGT 

TTGGCTTATGAGTGAATCTTGTTTCAGTGGATTACCGGAAGAAAATAATAAGTGAAC 

TGTAGGAAGAAGTAGTTAATCAAGGTGACAAAGTATCCTGACACATTGGGAAAAGA 

CCACAGTCCAGGAAACTGAGTCTTAAGGATTCATATTAACTCCAGTTCCCCATGTGC 

AGCrCTGAGACrTTGGCAGATCAGACACTTAACTTCACCAGCTTCCTACACAGAGCA 

GTTACTATCCTTGCCTTCACACATGGAGTGTGCCATTAAGTGCCTGAACATGAGTCT 

GACTTGTTAATAATCTTTAAAATCCAATTGTGTGTAAAGTATGTGACCAAAGAGCAT 

GGTCATC5CTATTAACCTTTGATGTTCTATGGACTCITAATTrTATGGTAGAAATGTCA 

ACAAGCTTGTGGAGGCTGGAAGATACAAGGCTTAAGAGGATGGCCTTTCAGTTTTG 

AAAGTAATTCAGTATGTGTTCTGGCATCCCTTTTCCTAAAGCAATTTAACCCCCCAA 

GTAGGCATAATTTTAATGCrrACTTCATCAGAATATATCTAATTGACTCTTCTAAAAA 

GACTTTGGTATGCATAGGATCTAAATGTAAATGTGATTTACTGACATAATAAATAGG 

AGAAACTGAGCTAGAATAGGTATAAAATATGTGCTCKjCTTTCTAATAGGTCTTATAG 

GTTATATAAGAGGTGGGAAAGGAATATTTGAAACATCTAGAAGTAAAATGATCCTG 

AGTAGCGATCCTGGGAAAATACGTACTCTAACACACTGCAATCATCTCTCTGTGGTT 

TGCTGGAGCTGAGGTCTGGAAGGCTCGACCTTGGTTAGAAATAACCTACCGAATAC 

agagctatgacgttagtctggaaggagctttggaagaatgacaagctgtagctgcc 

CAGAACATACTAGATGCCATATTTCCAAGGCAAGTGTCCACATGCGGACATCTTAAG 

AATATGG1TGTCTCTGCAGTGCTAAGGACCTTGTTCGTGCCACACAGGTCTCCAGGG 

TTAGTGCTAACTCTGACTGCTTGACTCTTTAATTCTCCCTTGATCATTAATGACTAGA 

AATCACTTGGTGATTAGCAACTGGATATGGAATATTACTTAATTTGTACCCAAGCCA 

GGCCACCTCAGCTTTGGCAGCTCCATTCATTCTGTGGAGCCCAGTCACGTGGGTTTG 

AATCAACTGTACTGTTTCTACTTACAAGACGCATTACCTGAGATGAGTCATTTTTCTT 

CACAAGTCTTTTTAGAAGAGTCAATTAGACATATTCTGATGAAGTAAGCATATAAAG 




tgagagcagcatgaatgtgttccatgtatgctcatggatgctattataatgtggaaa 
taaactgactttaaaaaaaaaagctratgatacttgtcac^^ 

aatatcatctgcatttataaattattttcataatccatcaattaaaaaccntagaaa 

ttttgttaacacaaagatccctaggcccctgccctaggatggtctgtatggtgggcc 

tgagagatggagcttaagaacttacttgctccaggacm:acatcttcagaacatctgc 

ctcaaaacatttatcccaaatgctcatcaaaggctcactcacatgtgcttcaaccac 

agggattaaacagtcattttagtcacatttctcaaacggtggaagcctgctagagga 

acaggatgtatcaggataacatccaaccttacaaaaggatgtcataaccctcacca 

caacaaacaacaacgacaacaaacccataaaaattatcacggcaaatgaactaagc 

catatgcagaaaaagtattatatgttctcattgtggggtgtttttccntaatagtcaa 

ATATGCAGAATATAGACAAAGATGGTTTATGCAAGTGGGGATGGCGAAGGATACTT 

GTAGATTAGAGGACACAAAGCAACAACTACAGAGTGAAGTAATCGAGAGACITAAT 

GTATAATATGAGGACTGTATTTAATAATTCTATTTAAGATACACAGCAAACGAGTGT 

ATOTACTAACACACACACITACATAGAGAGAATAAAGTGATAGATACGTTTGTTTT 

ATCTTCATGTAGCTGATAATTTCATATTGTACACCTCAAACATAGATAACCAACAAA 

GAGGAAGAGGATAGGTGCCTCTCCCAGGGCGGAAGAGTACATTCGAAAGTCAGACA 

CCATTGTGTAGATGTACCACATGGAGGAGCTAGAGAAAGTAGCCAAGGAGCTAAAG 

GGATCTGCAACCCTATAGGTGGAACAACATTATGAGCTAACCAGTACCCCGGAGCT 

CTTGACTCTAGCTGCATATATATCAAAAGATGGCCTAATCGGCCATCACTGGAAAGA 

GAGGCCATTGGACTTGCAAACTTTATATGCCCCAGTACAGGGGAATACCAGGGCCA 

AAAAGGGGGAGTGGGTGGGCAGGGGAGTGGGGGTGGGTGGATATGGGGGACTTTT 

GGTATAGCATTGGAAATGTAAATGAGTTAAATACCTAATAAAAAATGGAAAAAAAA 

AAAAAAAAAAAAAAAAAGGAAGGTCAGACACCTCACTTCACTGCTATCTCAACTTG 

CAAACAGAAGGGGAGTCACAAACCCAGGACAAACCACAGTGATTGAAGCGTCTTTG 

AATGTTATTGCTGTTGTTGTTACCACCATCATTAGCATATATTCATTGTGAAAACTTA 

CGGGGTCTATGACATGTTTTTTTATTCAAGTATATC^ 

ACCACTACCAGCCCCAGCCCCCTTTGCCCCGCCCCCAACACACACACACACACACAC 
ACACACACACACACACACACACACACACACACACACACCTTTACCTTCTCCTGGGCA 
TCATCTGCTCACTCACCCACCCAAGCTTAATCCTITTC 

TCCTATTTTTATGTCTAGGTTCCCCCTCCCCCTGTTAGGAGATGGGAGAGGTCACGA 

AAGAAAGGAATTTGTAGCCCTTGAGCCAGCCCGGGCCACAGAGCCTGCCACCAGAC 

AGGAAAAGCCCAGGGCTTACCAGCACAGGAGGAGCAAACTCGCAGGCGAGCCTGG 

GTTGGCGCTGGTGGTCCCGGGTCGATGGCCCGCCCATTCCCAGAAGCCGAGGCTATA 

GCTGCGTCACCTGCCCCGCCCTCCTCCCGAGTGAAGACCCCTAGAGGCTGAGCAGAC 

CCCAAAGGCGGTGCAATTCCATTGGCCCAAGGCAGAGGTGAGCGGCTGCTAATCCC 

CTCGGGAAGTGAAGGGACCCAGAGAGTCTGGTAGATGTGGGAGCTGGGGTTCAGGG 

CGAGACAGAGGGTGGGATGGGCAGAAGGGTCCAGGAAAGGAAAGTACTGGAGGGG 

AGTTGGGACAAAAGCAGCGACCAAGGGAACATCGCTTCAGTGACTGAAGCCAGGCA 

AAAGGAGCGGGAAGGATTATATGTAGCCTGGGACGCTTTCATAAACACTGATGACG 

TGTTTGTGCAAAGCAAGCAATTTGAGGAGAAACXjCCTGGGACGTCGGAAAGAAGGA 
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GTGATCGATTAGTACITGTAAGTTTAGGTGAGTTTGAGAACTAACTAACCTATACTA 

TTGAGGGAGAAGGAAGAGCATTCCAGCAGCAGCAGCAGCAGCAGCAATCAGATAA 

AGGAAAGCTTTGGTTAGTTTGGAAATGTATGATACCATTAAAATAACAGAAGCGCCT 

CCAGTTCTCTGAAGAGTCAGTCCCCCAGCTAGTGAAGACTAAGCCTACTAAGCCTTT 

TGCTCCCGTTGGAAGCAAAGAACGTTCCTTCAATCAGGTGAAGGCTCTCCTCAGAAG 

ATTTCCTGTCTCTGCTTATGTTACAAGAGGATTCAAAAGCAAGACAGAAGAGCTCAG 

GTATTGCCAACTCTrTTGTTAAATACAGTTTGAGGCTTAAGTGTACGGGAACTCATG 

TGGTATTCATTTACGGCTCTCITCTOTATAACTAACrOT 

TTCTGTITCCCAGCTACCTTGTACCATCTTTGTTTATCTAATAATAGCAAGCrCATCT 

GCHTITrAATCATCACGCAGAGAGTATTCAAAAATATTCAGTGATGTAACAGTGACA 

GTGTAG GC AT AG A AGTA ATC ATTAGTA A ATCTTA ATTTGGGTTA AACTC ATTC AT AA 

CAGCTCCAGGTTGGGAGGGATCACTGAGCCTTCGCCACGTGCGGGTTAAAGATATTT 

TCTAACAAGAGAAGCAGAATTCTTCCTTGGCCATGCTCCCCATCACTGTGTCAGTAA 

GCAGAGGGGTGTTTCCAAGCAGAGAAAGAGCAGACAGTGTTATGCCTGCAAAGTCA 

GAGACTCAGCCCTCCCAGCrGGTCAGTTTACTGTCCTCCCGGTCATTAGTTGGCTCTG 

aaaaggcccatgtgtccttattgck:aaggacttgcagacatgctagaaagaaattt 
gacctttttltctagtgggttattacagcrgtaaaagtattttggaaggttaagcca 
aataaataaaacacatattaaataatacaatgttacaaaaattgatcatataaaga 

AGTACATTCATAAATGCAATGTGAAAAATATATATAATTTTTATCTATlTACTGGTGrC 

AAAGTTTTCTAAATTGCACATGTACTATITrTATATTTATAAAAATAlllllAAAAT^ 

TATATAAAAGTGTAAAAGGCTCTTGGTCAAACAAGAGAGTTAAATITACAAACTTTA 

ATTGTCCCGATAACATTATTATGATCTCTAATGACAGGGATCCTGCTTITCATTGGGA 

AATGAGAAGCTATGAAGATATGTTTACAATAATAAGCCCATTTAGTGATAAAGTCCA 

ATGGGAAGCTAGCACACACTGGTTTATAAAGAGAACAGTTTCCTGAGTCTATGCAA 

G1TTACACTCTAGGGAATAAGAG1TCCTCTTTCTCCAGATTTCACTAGCATIT 

CATCATTTATCTTCTTGATGATGAGCATTATAAGTGGAATAAGATAGGATCTCAAAG 

GAATGTCAATTTGGATGCCCTGAACAATCTTrCAGGTCTTTOT 

TATTCATTTATTGGATAATTGGGGGGATGGTGKjTAATTTITITG^ 

TTCCAAAAAACAAAAAACAAACCAACCAACCAAAAACCTCTGAAACTAGAACTACC 

AATCCATTACTCK3GTATGTAACAAAGAGAAATCTGCACAGAATTTATTGCTACATTG 

TTCATTATTCACGACAGCCAAGAATGTGGAACCAACTTACGTAGCCGTCAAAATATG 

AACGGATAAAGAAAATGTGGAAATGTGTACAACAGAGTCCCATGTGGCCATAAAAG 

AGTGAAATCATGACATATGCAGGAAATGGATGCAACTGGAAATCAATTGGGCTAAT 

CAAAACAAGACAGACTCAAAAAGGAAACACCGTGTAGCTTCTCTGACAAACAGAAG 

CTAGATTTACACTTGTACGTGCGCATGTGTGTTTAGAATTrrAlTTAGTTATACACTA 

TTCTAATCTGTGAGTGTGTATAAAGGCATGCATGTAAAGCAAAAACAAGCTAGCTG 

GGGTGGGTAGGAGAGAAAGCAATGAGAGGAGTTAATAAGAACGAAGCATAGTAAC 

ATAGGTGCCAGGATGAAATGCATTAATTTGTATGCTAACTAAACCACAGACAGGAG 

GCACACGTTCAAACCAGGGTGAAATCCCAGCACAGAGAAGGGGAAGTAGACACAA 

AGTTTCGCCACTAACCAAGAAGCCATTTGCAGTTGCTGCCTGCTGGGAAGGGGCGTT 
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ccagttttctccagtctgacactgtgtataacaaccagttgacaatacaaagttggc 

atgatggatggtitttgtgcratttttcatttttm 

ggttgttgtggtggtggctgtggttttcatttgtttcttttgagagagagaag^ 

atgaaattgggtgggtaggaagctggaaacgatctggaagaagttggggaa agag 

aaaaattgtatggagcatatttaaacaaacaaacaaacaaacaaaaggttcatttt 

gccacaaaaaggtgtgaattaaattaaccagttacgactcttaaagaaaatattccc 

aattattcccagagitgctatgtatgctgtgcctaggacttigcttgaactggcccta 

taactctggtgtggtgtcttttcaggatgcagaagagaggcagggaagtcagctgct 

tgctgatctccctcactgccatctgcctggtggtcacccctgggagcagggtctgtc 

ctcgccgatgtgcctgctatgtgcccacagachjtgcactgtacatttcgggacctga 

cctccatcccagacgkjgkiiatcccacrccaatgtckjaacgagtcaatttagggtgtgtg 

gaccttgcctgatctccitctcagagagggaccactgattttcctggtacm 

ccaaacacctgtgattactitraatagttttcrtctaaaatgggttcatac 

atattgtggagacaatgaacattttatcccaatagtcttttactagaacttgaagcc 

cctctragttgtttgggagcctcataattatggggcagctttattct 

aaatgaaaaagatacagtttctgttaacaatcattatgataccaaggaagaggaat 

tgtcattgaatattttaaaaaagcatttcttttgcaatttataaataccca™ 

atggcttacttaaaatacitgccttactaaatctgacaaattatggtgatattttgaa 

ggtttatgaaaatttgtttatgtgtataaatgcacaagaaatgggatatgccatcac 

ctatgtgccattagtgagcatgtacagtatgccaaacactattgttcacgtttggag 

gaagtaatgggggtgggggagcaacaagggttataaccgtatacccagtgccttgg 

aagcgattgcaaacagtaaagactgacattgtgttctccctatgagggaggggcctt 

gggctgagcactttgcaatgagcatttgctcattgtgctggcaggtntatgataac 

ttgacccaagctagagtcactggagaggaaggaacttcaactgagaacatgcctga 

agaagatcagattataggcaggcctgtggggcattttcttaattagtgattcatggg 

gcagggcccagtccattgttcgtggtaccatttctcaggcactattaaaaaaaaaaa 

aacaggctgagcaagtgtcaaggagcaagtcagtgagcagcagccctaatgatctc 

tgcatcagctcctgcctccaggttcctaccctatttgagttcctgtcctagctcccta 

cagtgatgaacaatgatgtggaagtataagccaaataaatcctttcttccccaactt 

gctgttggtcatgatgtttcatcacagtgataatagtcctcatgaagatgctggtgt 

ttataacacctttggactaaattctgttatctatagctgaggaaaatggagcataga 

aagtctccagactacaccagagtgtaatctgggcctgagcttagaatcacacccac 

gtgcactccactgccggggcttcitaaccggaacacagttgtaaaagggaattttct 

gtttgtttccattttgacatgtggacittaattgacgattcatctgaagctgaaaatg 

A l 'll'l'rri'l'CCAGGTATAACAGCCTCACTAGATTGACAGAAAATGACTnTCTGGCCT 

GAGCAGACTGGAGTTACTCATGCTGCACAGCAATGGCATTCACAGAGTCAGTGACA 

AGACCTTCTCGGGCTTGCAGTCCTTGCAGGTGAGATAGGTAGAGGGTGATGGAGGC 

TGAGAAGAGAGGTGCAACTGTGGGTTATACCCAAAAGCTGCTGATTCCCGTGGGAG 

ACATTCTATAAGCATTCTATAAACTAGAGGCAGATATCAAGGAAGGATTTCAATTGT 

AATGCAATTTTATGAGAAAATTTGAATATTAAGAAAATGCTGGGGAAAATGCTTAC 
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acaattgcgaggacctaatttaggatctccaatagccacataaaaagcacagcatg 

gcggcagacacctgcaattcctgtccctggaagcacctgttcagaatcccagagact 

cattggccaaacactctattcaatcaatgaagtccatattcagtgacaaaacttgac 

tcagaaactaatgtggaaagcatcaggaagacagccaacatctggtctctactcat 

gcatgaataagggatcccagagagaagggaagaaaaaggaaggaaggaaggaag 

gaaggaaggaaggaaggaaggaaggaaggaaggaagagagggaggaaaggaggg 

agggaaggaaggaaagggaaaggaaaaaagagatggggagggagggaaggaaag 

gaaagggggagaaagaagagaagaaaggaaaataaataaattttcagggattatt 

acacctttaaattttatccataaaaggtcat1tccacctgtttgtctggaagtagagt 

gggatcccttatataagggcagtctitaacatagtagcattttataaaccattacaa 

attttgagttttctctactititatcctct 

cccacaaatgaagaaaatgctgtaagagttttcacacaccgaagtgggaaacttaa 

ggattagacaagtctaacaatgagaatggggagaacaaaaagagactgcacaggg 

agccctitctctgcttataatcttgacacntgagaagctaattgacgct 

ctcaactctttaagcaaacaatgctgttgttcatgaaaagcacaataaagtacatat 

gtcccataatattcatcaaaatttgcatgcagcacataatagcaatcaaagcaataa 

cacccactgttcacagagactttaaacatgaaacTggaactatgtctagtgttttga 

cttagggtacatagtatgctgtgtctgtatgtaccaatgttgatttaggtcatcaga 

cagcatttggaacatgtatcttcaggaggaatcattcatgtatcctgcatgaaattc 

tccacctatgtttattctctr^ 

aggttttactcccaggtaacatttagggaaaagctgtctatgttctcagtttggctt^ 

tatttatgagggatgttggtattccagaaaattctcttttgaagagattacaattta 

ggtcaaaacagaaaaatatgtaaaaagttattgtttttattagtatttcatgttcttt 

t<mtitraaaaatggtatgctragaactaattaagattagattagattagattaga 

aaataatcagagagggatttgatgaatgctaaagcatcatgaaaaattcaaaattt 

tttgcttctaattcagaatcaattaaattcatattactataaaagacagcacgccag 

atgtgtgccagctgaggagtggataaactgtgtaacgtgagtgctatgtagaaaca 

gaaaggagtgaagggttgatgtgcgctgcaacatcttgaaaacattcggctacatg 

atggaagccagk3cacaaaaagccacatattgcatggttatgtttatatgaaatgttt 

aaaatacatggattcttagcaaacagagtaagatgttacttagggtcaggaaaaga 

ttaaaaaaaaaaaaactattgatgtggaatgatcttaatttggggaaaagacaatt 

tcctaagacgaaatagttgaggtagatatagttatatccctgtggatattgtaataa 

accagcatgctgtgctctgagaagggcctaatgaaggggcaggaggaagtgaaatg 

agatggtagaaaggaaagtcatataccatggcttctctcgtgggtggaatctagat 

atgttaatatattgacataaaggaaggaattgtttagggaaggatcaaaaccaaca 

ggagtgagggagacaataggaaccaatgagaggcaaagttcatggtcaatgtgtgt 

ggagacaccataataaaactccitititgtttgcraacraaaaccactaaaatctaa 

aaacaaaacatttttgcacaagaattatttattattcaataaagatgtttaaatggg 

ggaagttgaagttcattgatagtctcataaatcttaaatgtatttaaactgcttttta 

cgttttttattattaattactcttgctgtcattattatcatcatcattatcgtcatcat 




78/90 

-V'-^. •■ **** J 

CATCACTAATGCTTTTCACCATACACAAATGTAGGCAGAAGAGTGTAATCCACTTAG 

TGAGGCAATCTTGGAGAGGGAAAGGAAGCGGATGCGGGGCAGAGGCACACAGGAG 

GACAGTGAGAGGGAAATGAACAAGAAAAAATGTGGACACATGCACAAAAATTCCA 

TAGTCCACTACATTACTTTGTATTCTAATATTAAGAAAATAATAAACCCATTTCTGTG 

CACTTATCACCCAGCO'CAACAGTTATCTTCrGCCACAGATCCTGTCTCACTGCATCCT 

GTCCACCTGAGTCCAOTAGCGTTCTGAATCCAATCCAGGGCATGATGCTTACTCCT 

ACACAGAACTAAAGATTAAAGAGAGTTTAAAAGTAACCATGACATCTCTCTGTTCCT 

TTAGCGATAAGTTCTTAATATTTATGGCTGCITGTGTATGTTCTAATTTCTCT 

GTCACATTTAGTTGGCAACTACTTTGTTTGAATTGAGTTGGAGTTAAGGTCCCATAG 

GATTAATCTCAACATATTTCTATATTTATAAACTllTCTCTCnTGTGAAAGTTCCT^ 

GAGAAAACAAATATGCCCATATCTTTCITrACACKJTCTTAAAAATGAGCTATAACAA 

AGTCCAAATAATTGAGAAGGATACTITGTATCKjACTCAGGAGCTTGACCCGKjTTGCA 

cctggatcacaacaacattgagtttatcaaccccgaggcgttttacggactcacctt 

gctccgcttggtacatctagaaggaaaccggctgacaaagctccatccagacacat 

ttgtctcitrgagctatctccagatatttaaaacctccttcattaagnacctgtac^ 

gtatgataacttcattgacctccctcccaaaagaaatggtctcctctatgccaaacc 

tagaaagcctttacttgcatggaaacccatggacctgtgactgccatttaaagtggt 

tgtccgagtggatgcagggaaacccaggtaactatcttgtttgtttgtttcl i'll 1 1 i 

atarkacgtattttcctcaatttcatttagaatgatatcccaaaagtcccccataacc 

tcccccccacttccctacctacccattcccattttttggccctggcattcccctg 

ggggcatataaagtttgcgtgtccaatggacctctctttccagtgatggccaactag 

gccatcttttgatacatatgcagctagagtcaagagctctggggtactggttagttc 

ataatgttgttgcacctacagggttgaa (seq id no:20) 



# 
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Figure 51 

MPKRAHWGAl^VVLIIXWGIffRVALACPHPCACYVPSEVHCTFRSLASVPAGIARHVE 
RINLGFNS IQ ALSETSFAGLTKLELLMIHGNEIPS IPDG ALRDLS SLQ VFKFS YNKLR VITG 
QlLQGl^NLMRLHlDHNKiEFlHPQAFNGLTSLRLLHLEGNLLHQLHPSTFSIFlTLDYl 7 R 
I^TIPJILYLAENMVRTLPASMIitfMPLLENLY^ 

KCKKDKAYEGGQLCAMCFSPKKLYKHEIHKLKDMTCLKPSIESPLRQNRSRSffiEEQEQ 
EEDGGSQLDJiKFQLPQWSISU^TDEHG 

ATVALDHiCPMTPJiNYEKiWKLIAYYSEVPVKLHRELMLSKDPRVSYQYRQDADEEAL 
YYTGVRAQILAEPEWVMQPSroiQLNRRQSTAKKVLLSYYTQYSQTISTKDTRQARGRS 
WVMffiPSGAVQRDQTVLEGGPCQI^CNVKASESPSIFWVLPDGSILKAPMDDPDSKFSIL 
SSGWLRIKSMEPSDSGLYQCIAQVRDEMDRMVYRVLVQSPSTQPAEKDTVTIGKNPGES 
WLPCNALAIPEAHI^WILPNRRIINDLAOTSHVYMLPNGTLSIPKVQVSDSGYYRCVAV 
NQQGADHFTVGITVTKKGSGIJPSKRGRRPGAKAI^RVREDrvnEDEGGSGMGDEEOTSR 
RLLHPKDQEVF1.KTKDDAFNGDKKAKKGRRKLK1.WKHSEKEPETNVAEGRRVFESRRR 
INMANKQINPERWADILAKWGKmPKGTEWPLIKTTSPPSI^IJEVTPPFPAVSP 
VQTVTSAEESSADWLLGEEEHVLGTISSASMGIJEHI^GVILVEPEVTSTPIJEEVVDDL 
SEKTEEITSTEGDLKGTAAPTLISEPYEPSPTLHTLDTVYEKPTHEETATEGWSAADVGSS 
PEPTSSEYEPPLDAVSLAESEPMQYFDPDLETKSQPDEDKMKEDTFAHLTPTPTIWVNDS 
STSQDFEDSTIGEPGVPGQSHLQGLTDNIHLVKSSLSTQDTLLIKKGMKEMSQTLQGGNM 
I^GDPTHSRSSESEGQESKSITLPDSTLGMSSMSPVKKTAETTVGTLLDKDTTTVTTTPR 
QKVAPSSTMSTHPSPJIRPNGRRRIJRPNIOT^ 

QVESSLVPTAWVDNTVNTPKQLEMEKNAEPTSKGTPRRKHGKRPNKHRYTPSTVSSRA 
SGSKPSPSPENKHRNTVTPSSETILLPRTVSLKTEGPYDSII)YMTTTR^ 
WYKPTSDGKEIKDDVATNVDKHKSDILVTGESITNAIPTSRSLVSTMGEFKEESSPVGFP 
GTPTWNPSRTAQPGRLQTDIPVTTSGENLTDPPLLKELEDVDFTSEFLSSLTVSTPFHQEE 
AGSSTTI^SIKVEVASSQAETTTLDQDHI^TTVAILLSETRPQNHTPTAARMKEPASSSPS 
T1LMSLGQTTTTKPALPSPRISQASRDSKENTVFLNYVGNPETEATPVNNEGTQHM 
I^TPSSDRDAI'NLSTKLELEKQVFGSRSLPRGPDSQRQDGRVHASHQLTRVPAKPILPTA 
TVRIJEMSTQSASRYF^SQSPRHWTNKPEITTYPSGALPE 

SKPSIPSKFTDRRTDQFNGYSKVFGNNNIPEARNPVGKPPSPRIPHYSNGRLPFFTNKTLSF 
PQLGVTRRPQIPTSPAPVMRERKVIPGSYNR1HSHSTFHLDFGPPAPPLLHTPQ1TGSPSTN 
LQNIPMVSSTQSSISFITSSVQSSGSFHQSSSKFFAGGPPASKFWSLGEKPQILTKSPQTVSV 
TAETDTVFPCEATGKPKPFVTWTKVSTGALMTPNTRIQRFEVLKNGTLVIRKVQVQDRG 
QYMCTASNLHGLDRMVVLLSVTVQQPQILASHYQDVTVYLGDTIAMECLAKGTPAPQI 
SWnTDRRVWQTVSPVESRITLHENRTLSIKEASFSDRGVYKCVASNAAGADSLAIRLHV 
AALPPVfflQEKLENISLPPGLSIHIHCTAKAAPLPSVRWVLGDGTQIRPSQFLHGNLFVFP 
NGTLYIRNLAPKDSGRYECVAANLVGSARRTVQLNVQRAAANARITGTSPRRTDVRYG 
GTLKLDCSASGDPWPRELWRLPSKRMEDALFSFDSRIKVFANGTLVVKSVTDK 
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DAGDYLCVAR^VGDDYVVLKVDVVMKPAKIEHKEENDHKVFYGGDLKVDCVATGL 

PNPEISWSIJ'DGSLVNSFMQSDDSGGRTKIIYVVI^GTLYFNEVGMREEGDYTCFAEN 

QVGKDEMRVRVKVVTAPATIRNKTYLAVQWYGDVWVACEAKGEPMPKVTWLSPTN 

KVIPTSSEKYQIYQDGTLLIQKAQRSDSGNYTCLVRNSAGEDRKTVWIHVNVQPPKING 

NPWITTVl^JAAGGSRKLIDCKAEGIFrPRVLWAFPEGVVLPAPYYGNRITVHGNGSLDI 

RSIJlKSDSVQLVCMAPvNEGGEARLIVQLTVIJEPMEKPIFBDPISE^^ 

AAGTPTPSLVWVLPNGTDLQSGQQLQRFYHKADGMLfflSGLSSVDAGAYRCVARNAA 

GHTERLVSLKVGLKPEANKQYHNLVSIINGETLKIJCTPPGAGQGRFSWTLPNGMHLEG 

PQTLGRVSLLDNGTLTVREASVFDRGTYVCPJvlETEYGPSVTSffVrVIAYPPRITSEPTPVI 

YTRPGNTVKL^CMAMGIPKADITVVELPDKSHLKAGVQARLYGNRPLIiPQGSLTIQHAT 

QRDAGFYKCMAKNILGSDSKTTYIHVF (SEQ ID NO: 21) 
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ATGCCCAAGC GCGCGCACTG GGGGGCCCTC TCCGTGGTGC TGATCCTGCT 
TTGGGGCCAT CCGCGAGTGG CGCTGGCCTG CCCGCATCCT TGTGCCTGCT 
ACGTCCCCAG CGAGGTCCAC TGCACGTTCC GATCCCTGGC TTCCGTGCCC 
GCTGGCATTG CTAGACACGT GGAAAGAATC AATTTGGGGT TTAATAGCAT 
ACAGGCCCTG TCAGAAACCT CATTTGCAGG ACTGACCAAG TTGGAGCTAC 
TTATGATTCA CGGCAATGAG ATCCCAAGCA TCCCCGATGG AGCTTTAAGA 
GACCTCAGCT CTCTTCAGGT TTTCAAGTTC AGCTACAACA AGCTGAGAGT 
GATCACAGGA CAGACCCTCC AGGGTCTCTC TAACTTAATG AGGCTGCACA 
TTGACCACAA CAAGATCGAG TTTATCCACC CTCAAGCTTT CAACGGCTTA 
ACGTCTCTGA GGCTACTCCA TTTGGAAGGA AATCTCCTCC ACCAGCTGCA 
| □ CCCCAGCACC TTCTCCACGT TCACATTTTT GGATTATTTC AGACTCTCCA 
ifl CCATAAGGCA CCTCTACTTA GCAGAGAACA TGGTTAGAAC TCTTCCTGCC 
AGCATGCTTC GGAACATGCC GCTTCTGGAG AATCTTTACT TGCAGGGAAA 
□ TCCGTGGACC TGCGATTGTG AGATGAGATG GTTTTTGGAA TGGGATGCAA 
P AATCCAGAGG AATTCTGAAG TGTAAAAAGG ACAAAGCTTA TGAAGGCGGT 
J CAGTTGTGTG CAATGTGCTT CAGTCCAAAG AAGTTGTACA AACATGAGAT 
| H ACACAAGCTG AAGGACATGA CTTGTCTG AA GCCTTCAATA GAGTCCCCTC 

TGAGACAGAA CAGGAGCAGG AGTATTGAGG AGGAGCAAGA ACAGGAAGAG 
J' 3 GATGGTGGCA GCCAGCTCAT CCTGGAGAAA TTCCAACTGC CCCAGTGGAG 
ji CATCTCTTTG AATATGACCG ACGAGCACGG GAACATGGTG AACTTGGTCT 
ry GTGACATCAA GAAACCAATG GATGTGTACA AGATTCACTT GAACCAAACG 
fU GATCCTCCAG ATATTGACAT AAATGCAACA GTTGCCTTGG ACTTTGAGTG 
Q TCCAATGACC CGAGAAAACT ATGAAAAGCT ATGGAAATTG ATAGCATACT 
rU ACAGTGAAGT TCCCGTGAAG CTACACAGAG AGCTCATGCT CAGCAAAGAC 
CCCAGAGTCA GCTACCAGTA CAGGCAGGAT GCTGATGAGG AAGCTCTTTA 
CTACACAGGT GTGAGAGCCC AGATTCTTGC AGAACCAGAA TGGGTCATGC 
AGCCATCCAT AGATATCCAG CTGAACCGAC GTCAGAGTAC GGCCAAGAAG 
GTGCTACTTT CCTACTACAC CCAGTATTCT CAAACAATAT CCACCAAAGA 
TACAAGGCAG GCTCGGGGCA GAAGCTGGGT AATGATTGAG CCTAGTGGAG 
CTGTGCAAAG AGATCAGACT GTCCTGGAAG GGGGTCCATG CCAGTTGAGC 
TGCAACGTGA AAGCTTCTGA GAGTCCATCT ATCTTCTGGG TGCTTCCAGA 
TGGCTCCATC CTGAAAGCGC CCATGGATGA CCCAGACAGC AAGTTCTCCA. 
TTCTCAGCAG TGGCTGGCTG AGGATCAAGT CCATGGAGCC ATCTGACTCA 
GGCTTGTACC AGTGCATTGC TCAAGTGAGG GATGAAATGG ACCGCATGGT 
ATATAGGGTA CTTGTGCAGT CTCCCTCCAC TCAGCCAGCC GAGAAAGACA 
CAGTGACAAT TGGCAAGAAC CCAGGGGAGT CGGTGACATT GCCTTGCAAT 
GCTTTAGCAA TACCCGAAGC CCACCTTAGC TGGATTCTTC CAAACAGAAG 
GATAATTAAT GATTTGGCTA ACACATCACA TGTATACATG TTGCCAAATG 
GAACTCTTTC CATCCCAAAG GTCCAAGTCA GTGATAGTGG 
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TTACTACAGA TGTGTGGCTG TCAACCAGCA AGGGGCAGAC CATTTTACGG 
TGGGAATCAC AGTGACCAAG AAAGGGTCTG GCTTGCCATC CAAAAGAGGC 
AGACGCCCAG GTGCAAAGGC TCTTTCCAGA GTCAGAGAAG ACATCGTGGA 
GGATGAAGGG GGCTCGGGCA TGGGAGATGA AGAGAACACT TCAAGGAGAC 
TTCTGCATCC AAAGGACCAA GAGGTGTTCC TCAAAACAAA GGATGATGCC 
ATCAATGGAG ACAAGAAAGC CAAGAAAGGG AGAAGAAAGC TGAAACTCTG 
GAAGCATTCG GAAAAAGAAC CAGAGACCAA TGTTGCAGAA GGTCGCAGAG 
TGTTTGAATC TAGACGAAGG ATAAACATGG CAAACAAACA GATTAATCCG 
GAGCGCTGGG CTGATATTTT AGCCAAAGTC CGTGGGAAAA ATCTCCCTAA 
GGGCACAGAA GTACCCCCGT TGATTAAAAC CACAAGTCCT CCATCCTTGA 
GCCTAGAAGT CACACCACCT TTTCCTGCTG TTTCTCCCCC CTCAGCATCT 
CCTGTGCAGA CAGTAACCAG TGCTGAAGAA TCCTCAGCAG ATGTACCTCT 
ACTTGGTGAA GAAGAGCACG TTTTGGGTAC CATTTCCTCA GCCAGCATGG 
GGCTAGAACA CAACCACAAT GGAGTTATTC TTGTTGAACC TGAAGTAACA 
AGCACACCTC TGGAGGAAGT TGTTGATGAC CTTTCTGAGA AGACTGAGGA 
GATAACTTCC ACTGAAGGAG ACCTGAAGGG GACAGCAGCC CCTACACTTA 
TATCTGAGCC TTATGAACCA TCTCCTACTC TGCACACATT AGACACAGTC 
TATGAAAAGC CCACCCATGA AGAGACGGCA ACAGAGGGTT GGTCTGCAGC 
AGATGTTGGA TCGTCACCAG AGCCCACATC CAGTGAGTAT GAGCCTCCAT 
TGGATGCTGT CTCCTTGGCT GAGTCTGAGC CCATGCAATA CTTTGACCCA 
GATTTGGAGA CTAAGTCACA ACCAGATGAG GATAAGATGA AAGAAGACAC 
CTTTGCACAC CTTACTCCAA CCCCCACCAT CTGGGTTAAT GACTCCAGTA 
CATCACAGTT ATTTGAGGAT TCTACTATAG GGGAACCAGG TGTCCCAGGC 
CAATCACATC TACAAGGACT GACAGACAAC ATCCACCTTG TGAAAAGTAG 
TCTAAGCACT CAAGACACCT TACTGATTAA AAAGGGTATG AAAGAGATGT 
CTCAGACACT ACAGGGAGGA AATATGCTAG AGGGAGACCC CACACACTCC 
AGAAGTTCTG AGAGTGAGGG CCAAGAGAGC AAATCCATCA CTTTGCCTGA 
CTCCACACTG GGTATAATGA GCAGTATGTC TCCAGTTAAG AAGCCTGCGG 
AAACCACAGT TGGTACCCTC CTAGACAAAG ACACCACAAC AGTAACAACA 
ACACCAAGGC AAAAAGTTGC TCCGTCATCC ACCATGAGCA CTCACCCTTC 
TCGAAGGAGA CCCAACGGGA GAAGGAGATT ACGCCCCAAC AAATTCCGCC 
ACCGGCACAA GCAAACCCCA CCCACAACTT TTGCCCCATC AGAGACTTTT 
TCTACTCAAC CAACTCAAGC ACCTGACATT AAGATTTCAA GTCAAGTGGA 
GAGTTCTCTG GTTCCTACAG CTTGGGTGGA TAACACAGTT AATACCCCCA . 
AACAGTTGGA AATGGAGAAG AATGCAGAAC CCACATCCAA GGGAACACCA 
CGGAGAAAAC ACGGGAAGAG GCCAAACAAA CATCGATATA CCCCTTCTAC 
AGTGAGCTCA AGAGCGTCCG GATCCAAGCC CAGCCCTTCT CCAGAAAATA 
AACATAGAAA CATTGTTACT CCCAGTTCAG AAACTATACT TTTGCCTAGA 
ACTGTTTCTC TGAAAACTGA GGGCCCTTAT GATTCCTTAG ATTACATGAC 
AACCACCAGA AAAATATATT CATCTTACCC TAAAGTCCAA GAGACACTTC 
CAGTCACATA 
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TAAACCCACA TCAGATGGAA AAGAAATTAA GGATGATGTT GCCACAAATG 
TTGACAAACA TAAAAGTGAC ATTTTAGTCA CTGGTGAATC AATTACTAAT 
GCCATACCAA CTTCTCGCTC CTTGGTCTCC ACTATGGGAG AATTTAAGGA 
AGAATCCTCT CCTGTAGGCT TTCCAGGAAC TCCAACCTGG AATCCCTCAA 
GGACGGCCCA GCCTGGGAGG CTACAGACAG ACATACCTGT TACCACTTCT 
GGGGAAAATC TTACAGACCC TCCCCTTCTT AAAGAGCTTG AGGATGTGGA 
TTTCACTTCC GAGTTTTTGT CCTCTTTGAC AGTCTCCACA CCATTTCACC 
AGGAAGAAGC TGGTTCTTCC ACAACTCTCT CAAGCATAAA AGTGGAGGTG 
GCTTCAAGTC AGGCAGAAAC CACCACCCTT GATCAAGATC ATCTTGAAAC 
CACTGTGGCT ATTCTCCTTT CTGAAACTAG ACCACAGAAT CACACCCCTA 
CTGCTGCCCG GATGAAGGAG CCAGCATCCT CGTCCCCATC CACAATTCTC 
ATGTCTTTGG GACAAACCAC CACCACTAAG CCAGCACTTC CCAGTCCAAG 
AATATCTCAA GCATCTAGAG ATTCCAAGGA AAATGTTTTC TTGAATTATG 
TGGGGAATCC AGAAACAGAA GCAACCCCAG TCAACAATGA AGGAACACAG 
CATATGTCAG GGCCAAATGA ATTATCAACA CCCTCTTCCG ACCGGGATGC 
ATTTAACTTG TCTACAAAGC TGGAATTGGA AAAGCAAGTA TTTGGTAGTA 
GGAGTCTACC ACGTGGCCCA GATAGCCAAC GCCAGGATGG AAGAGTTCAT 
GCTTCTCATC AACTAACCAG AGTCCCTGCC AAACCCATCC TACCAACAGC 
AACAGTGAGG CTACCTGAAA TGTCCACACA AAGCGCTTCC AGATACTTTG 
TAACTTCCCA GTCACCTCGT CACTGGACCA ACAAACCGGA AATAACTACA 
TATCCTTCTG GGGCTTTGCC AGAGAACAAA CAGTTTACAA CTCCAAGATT 
ATCAAGTACA ACAATTCCTC TCCCATTGCA CATGTCCAAA CCCAGCATTC 
CTAGTAAGTT TACTGACCGA AGAACTGACC AATTCAATGG TTACTCCAAA 
GTGTTTGGAA ATAACAACAT CCCTGAGGCA AGAAACCCAG TTGGAAAGCC 
TCCCAGTCCA AGAATTCCTC ATTATTCCAA TGGAAGACTC CCTTTCTTTA 
CCAACAAGAC TCTTTCTTTT CCACAGTTGG GAGTCACCCG GAGACCCCAG 
ATACCCACTT CTCCTGCCCC AGTAATGAGA GAGAGAAAAG TTATTCCAGG 
TTCCTACAAC AGGATACATT CCCATAGCAC CTTCCATCTG GACTTTGGCC 
CTCCGGCACC TCCGTTGTTG CACACTCCGC AGACCACGGG ATCACCCTCA 
ACTAACTTAC AGAATATCCC TATGGTCTCT TCCACCCAGA GTTCTATCTC 
CTTTATAACA TCTTCTGTCC AGTCCTCAGG AAGCTTCCAC CAGAGCAGCT 
CAAAGTTCTT TGCAGGAGGA CCTCCTGCAT CCAAATTCTG GTCTCTTGGG 
GAAAAGCCCC AAATCCTCAC CAAGTCCCCA CAGACTGTGT CCGTCACCGC 
TGAGACAGAC ACTGTGTTCC CCTGTGAGGC AACAGGAAAA CCAAAGCCTT 
TCGTTACTTG GACAAAGGTT TCCACAGGAG CTCTTATGAC TCCGAATACC 
AGGATACAAC GGTTTGAGGT TCTCAAGAAC GGTACCTTAG TGATACGGAA 
GGTTCAAGTA CAAGATCGAG GCCAGTATAT GTGCACCGCC AGCAACCTGC 
ACGGCCTGGA CAGGATGGTG GTCTTGCTTT CGGTCACCGT GCAGCAACCT 
CAAATCCTAG CCTCCCACTA CCAGGACGTC ACTGTCTACC TGGGAGACAC 
CATTGCAATG GAGTGTCTGG CCA A AGGG AC CCCAGCCCCC CAAATTTCCT 
GGATCTTCCC TGACAGGAGG GTGTGGCAAA CTGTGTCCCC 
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CGTGGAGAGC CGCATCACCC TGCACGAAAA CCGGACCCTT TCCATCAAGG 
AGGCGTCCTT CTCAGACAGA GGCGTCTATA AGTGCGTGGC CAGCAATGCA 
GCCGGGGCGG ACAGCCTGGC CATCCGCCTG CACGTGGCGG CACTGCCCCC 
CGTTATCCAC CAGGAGAAGC TGGAGAACAT CTCGCTGCCC CCGGGGCTCA 
GCATTCAC AT TCACTGCACT GCCAAGGCTG CGCCCCTGCC CAGCGTGCGC 
TGGGTGCTCG GGGACGGTAC CCAGATCCGC CCCTCGCAGT TCCTCCACGG 
GAACTTGTTT GTTTTCCCCA ACGGGACGCT CTACATCCGC AACCTCGCGC 
CCAAGGACAG CGGGCGCTAT GAGTGCGTGG CCGCCAACCT GGTAGGCTCC 
GCGCGCAGGA CGGTGCAGCT GAACGTGCAG CGTGCAGCAG CCAACGCGCG 
CATCACGGGC ACCTCCCCGC GGAGGACGGA CGTCAGGTAC GGAGGAACCC 
TCAAGCTGGA CTGCAGCGCC TCGGGGGACC CCTGGCCGCG CATCCTCTGG 
AGGCTGCCGT CCAAGAGGAT GATCGACGCG CTCTTCAGTT TTGATAGCAG 
p AATCAAGGTG TTTGCCAATG GGACCCTGGT GGTGAAATCA GTGACGGACA 
IB AAGATGCCGG AGATTACCTG TGCGTAGCTC GAAATAAGGT TGGTGATGAC 
» S TACGTGGTGC TCAAAGTGGA TGTGGTGATG AAACCGGCCA AGATTGAACA 
□ CAAGGAGGAG AACGACCACA AAGTCTTCTA CGGGGGTGAC CTGAAAGTGG 
Ifl ACTGTGTGGC CACCGGGCTT CCCAATCCCG AGATCTCCTG GAGCCTCCCA 
!'* GACGGGAGTC TGGTGAACTC CTTCATGCAG TCGGATGACA GCGGTGGACG 
1 " C ACCAAGCGC TATGTCGTCT TCAACAATGG GACACTCTAC TTTA ACGA AG 
* 3 TGGGGATGAG GGAGGAAGGA GACTACACCT GCTTTGCTGA AAATCAGGTC 
U GGGAAGGACG AGATGAGAGT CAGAGTCAAG GTGGTGACAG CGCCCGCCAC 
it CATCCGGAAC AAGACTTACT TGGCGGTTCA GGTGCCCTAT GGAGACGTGG 
j-y TCACTGTAGC CTGTGAGGCC AAAGGAGAAC CCATGCCCAA GGTGACTTGG 
J TTGTCCCC AA CCAACAAGGT GATCCCCACC TCCTCTGAGA AGTATCAGAT 
O ATACCAAGAT GGCACTCTCC TTATTCAGAA AGCCCAGCGT TCTGACAGCG 
iU GCAACTACAC CTGCCTGGTC AGGAACAGCG CGGGAGAGGA TAGGAAGACG 
GTGTGGA1TC ACGTCAACGT CCAGCCACCC AAGATCAACG GTAACCCCAA 
CCCCATCACC ACCGTGCGGG AGATAGCAGC CGGGGGCAGT CGGAAACTGA 
TTGACTGCAA AGCTGAAGGC ATCCCCACCC CGAGGGTGTT ATGGGCTTTT 
CCCGAGGGTG TGGTTCTGCC AGCTCCATAC TATGGAAACC GGATCACTGT 
CCATGGCAAC GGTTCCCTGG ACATCAGGAG TTTGAGGAAG AGCGACTCCG 
TCCAGCTGGT ATGCATGGCA CGCAACGAGG GAGGGGAGGC GAGGTTGATC 
GTGCAGCTCA CTGTCCTGGA GCCCATGGAG AAACCCATCT TCCACGACCC 
GATCAGCGAG AAGATCACGG CCATGGCGGG CCACACCATC AGCCTCAACT 
GCTCTGCCGC GGGGACCCCG ACACCCAGCC TGGTGTGGGT CCTTCCCAAT 
GGCACCGATC TGCAGAGTGG ACAGCAGCTG CAGCGCTTCT ACCACAAGGC 
TGACGGCATG CTACACATTA GCGGTCTCTC CTCGGTGGAC GCTGGGGCCT 
ACCGCTGCGT GGCCCGCAAT GCCGCTGGCC ACACGGAGAG GCTGGTCTCC 
CTGAAGGTGG GACTGAAGCC AGAAGCAAAC AAGCAGTATC ATAACCTGGT 
CAGCATCATC AATGGTGAGA CCCTGAAGCT CCCCTGCACC CCTCCCGGGG 
CTGGGCAGGG ACGTTTCTCC TGGACGCTCC CCAATGGCAT GCATCTGGAG 



GGCCCCCAAA CCCTGGGACG CGTTTCTCTT CTGGACAATG GCACCCTCAC 
GGTTCGTGAG GCCTCGGTGT TTGACAGGGG TACCTATGTA TGCAGGATGG 
AGACGGAGTA CGGCCCTTCG GTCACCAGCA TCCCCGTGAT TGTGATCGCC 
TATCCTCCCC GGATCACCAG CGAGCCCACC CCGGTCATCT ACACCCGGCC 
CGGGAACACC GTGAAACTGA ACTGCATGGC TATGGGGATT CCCAAAGCTG 
ACATCACGTG GGAGTTACCG GATAAGTCGC ATCTGAAGGC AGGGGTTCAG 
GCTCGTCTGT ATGGAAACAG ATTTCTTCAC CCCCAGGGAT CACTGACCAT 
CCAGCATGCC ACACAGAGAG ATGCCGGCTT CTACAAGTGC ATGGCAAAAA 
ACATTCTCGG CAGTGACTCC AAAACAACTT ACATCCACGT CTTCTGAAAT 
GTGGATTCCA GAATGATTGC TTAGGAACTG ACAACAAAGC GGGGTTTGTA 
AGGGAAGCCA GGTTGGGGAA TAGGAGCTCT TAAATAATGT GTCACAGTGC 
ATGGTGGCCT CTGGTGGGTT TCAAGTTGAG GTTGATCTTG ATCTACAATT 
GTTGGGAAAA GGAAGCAATG CAGACACGAG AAGGAGGGCT CAGCCTTGCT 
GAGACACTTT CTTTTGTGTT TACATCATGC CAGGGGCTTC ATTCAGGGTG 
TCTGTGCTCT GACTGCAATT TTTCTTCTTT TGCAAATGCC ACTCGACTGC 
CTTCATAAGC GTCCATAGGA TATCTGAGGA ACATTCATCA AAAATAAGCC 
ATAGACATGA ACAACACCTC ACTACCCCAT TGAAGACGCA TCACCTAGTT 
AACCTGCTGC AGTTTTTACA TGATAGACTT TGTTCCAGAT TGACAAGTCA 
TCTTTCAGTT ATTTCCTCTG TCACTTCAAA ACTCCAGCTT GCCCAATAAG 
GATTTAGAAC CAGAGTGACT GATATATATA TATATATTTT AATTCAGAGT 
TACATACATA CAGCTACCAT TTTATATGAA AAAAGAAAAA CATTTCTTCC 
TGGAACTCAC TTTTTATATA ATGTTTTATA TATATATTTT TTCCTTTCAA 
ATCAGACGAT GAGACTAGAA GGAGAAATAC TTTCTGTCTT ATTAAAATTA 
ATAAATTATT GGTCTTTACA AGACTTGGAT ACATTACAGC AGACATGGAA 
ATATAATTTT AAAAAATTTC TCTCCAACCT CCTTCAAATT CAGTCACCAC 
TGTTATATTA CCTTCTCCAG GAACCCTCCA GTGGGGAAGG CTGCGATATT 
AGATTTCCTT GTATGCAAAG TTTTTGTTGA AAGCTGTGCT CAGAGGAGGT 
GAGAGGAGAG GAAGGAGAAA ACTGCATCAT AACTTTACAG AATTGAATCT 
AGAGTCTTCC CCGAAAAGCC C AGAAACTTC TCTGCAGTAT CTGGCTTGTC 
CATCTGGTCT AAGGTGGCTG CTTCTTCCCC AGCCATGAGT CAGTTTGTGC 
CCATGAATAA TACACGACCT GTTATTTCCA TGACTGCTTT ACTGTATTTT 
TAAGGTCAAT ATACTGTACA TTTGATAATA AAATAATATT CTCCCAAAAA AAAAA 
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Figure 53 



Human QCP: nucleotide sequence of ORF 



ATGAAGGTAAAAGGCAGAGGAATCACCTGCTTGCTGGTCTCCTTTGCTGTGATCTGC 

CTGGTCGCCACCCCTGGGGGCAAGGCCTGTCCTCGCCGCTGTGCCTGTTATATGCCT 

ACGGAGGTACACTGCACATTTCGGTACCTGACTTCCATCCCAGACAGCATCCCGCCC 

AATGTGGAACGCATCAATTTAGGATACAACAGCTTGGTTAGATTGATGGAAACAGAT 

TTTTCTGGCCTGACCAAACTGGAGTTACTCATGCTTCACAGCAATGGCATTCACACA 

ATCCCTGACAAGACCTTCTCAGATTTGCAGGCCTTGCAGGTCTTAAAAATGAGCTAT 

AATAAAGTCCGAAAACTTCAGAAAGATACTTTTTATGGCCTCAGGAGCTTGACACGA 

TTGCACATGGACCACAACAATATTGAGTTTATAAACCCAGAGGTTTTTTATGGGCTC 

AACTTTCTCCGCCTGGTGCACTTGGAAGGAAATCAGCTCACTAAGCTCCACCCAGAT 

ACATTTGTCTCTTTGAGCTACCTCCAGATATTTAAAATCTCTTTCATTAAGTTCCTA 

TACTTGTCTGATAACTTCCTGACCTCCCTCCCTCAAGAGATGGTCTCCTATATGCCT 

GACCTAGACAGCCTTTACCTGCATGGAAACCCATGGACCTGTGATTGC.CATTTAAAG 

TGGTTGTCTGACTGGATACAGGAGAAGCCAGATGTAATAAAATGCAAAAAAGATAGA 

AGTCCCTCTAGTGCTCAGCAGTGTCCACTTTGCATGAACCCTAGGACTTCTAAAGGC 

AAGCCGTTAGCTATGGTCTCAGCTGCAGCTTTCCAGTGTGCCAAGCCAACCATTGAC 

TCATCCCTGAAATCAAAGAGCCTGACTATTCTGGAAGACAGTAGTTCTGCTTTCATC 

TCTCCCCAAGGTTTCATGGCACCCTTTGGCTCCCTCACTTTGAATATGACAGATCAG 

TCTGGAAATGAAGCTAACATGGTCTGCAGTATTCAAAAGCCCTCAAGGACATCACCC 

ATTGCATTCACTGAAGAAAATGACTACATCGTGCTAAATACTTCATTTTCAACATTT 

TTGGTGTGCAACATAGATTACGGTCACATTCAGCCAGTGTGGCAAATTTTGGCTTTG 

TACAGTGATTCTCCTCTGATACTAGAAAGGAGCCACTTGCTTAGTGAAACACCGCAG 

CTCTATTACAAATATAAACAGGTGGCTCCTAAGCCTGAAGACATTTTTACCAACATA 

GAGGCAGATCTCAGAGCAGATCCCTCTTGGTTAATGCAAGACCAAATTTCCTTGCAG 

CTGAACAGAACTGCCACCACATTCAGTACATTACAGATCCAGTACTCCAGTGATGCT 

CAAATCACTTTACCAAGAGCAGAGATGAGGCCAGTGAAACACAAATGGACTATGATT 

TCAAGGGATAACAATACTAAGCTGGAACATACTGTCTTGGTAGGTGGAACCGTTGGC 

CTGAACTGCCCAGGCCAAGGAGACCCCACCCCACACGTGGATTGGCTTCTAGCTGAT 

GGAAGTAAAGTGAGAGCCCCTTATGTCAGTGAGGATGGACGGATCCTAATAGACAAA 

AGTGGAAAATTGGAACTCCAGATGGCTGATAGTTTTGACACAGGCGTATATCACTGT 

ATAAGCAGCAATTATGATGATGCAGATATTCTCACCTATAGGATAACTGTGGTAGAA 

CCTTTGGTCGAAGCCTATCAGGAAAATGGGATTCATCACACAGTTTTCATTGGTGAA 

ACACTTGATCTTCCATGCCATTCTACTGGTATCCCAGATGCCTCTATTAGCTGGGTT 

ATTCCAGGAAACAATGTGCTCTATCAGTCATCAAGAGACAAGAAAGTTCTAAACAAT 

GGCACATTAAGAATATTACAGGTCACCCCGAAAGACCAAGGTTATTATCGCTGTGTG 

GCAGCCAACCCATCAGGGGTTGATTTTTTGATTTTCCAAGTTTCAGTCAAGATGAAA 

GGACAAAGGCCCTTGGAGCATGATGGAGAAACAGAGGGATCTGGACTTGATGAGTCC 

AATCCTATTGCTCATCTTAAGGAGCCACCAGGTGCACAACTCCGTACATCTGCTCTG 

ATGGAGGCTGAGGTTGGAAAACACACCTCAAGCACAAGTAAGAGGCACAACTATCGG 

GAATTAACACTCCAGCGACGTGGAGATTCAACACATCGACGTTTTAGGGAGAATAGG 

AGGCATTTCCCTCCCTCTGCTAGGAGAATTGACCCACAACATTGGGCGGCACTGTTG 

GAGAAAGCTAAAAAGAATGCTATGCCAGACAAGCGAGAAAATACCACAGTGAGCCCA 

CCCCCAGTGGTCACCCAACTCCCAAACATACCTGGTGAAGAAGACGATTCCTCAGGC 

ATGCTCGCTCTACATGAGGAATTTATGGTCCCGGCCACTAAAGCTTTGAACCTTCCA 

GCAAGGACAGTGACTGCTGACTCCAGAACAATATCTGATAGTCCTATGACAAACATA 

AATTATGGCACAGAATTCTCTCCTGTTGTGAATTCACAAATACTACCACCTGAAGAA 

CCCACAGATTTCAAACTGTCTACTGCTATTAAAACTACAGCCATGTCAAAGAATATA 
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AACCCAACCATGTCAAGCCAAATACAAGGCACAACCAATCAACATTCATCCACTGTC 
TTTCCACTGCTACTTGGAGCAACTGAATTTCAGGACTCTGACCAGATGGGAAGAGGA 
AGAGAGCATTTCCAAAGTAGACCCCCAATAACAGTAAGGACTATGATCAAAGATGTC 
AATGTCATU^ATGCTTAGTAGCACCACCAACAAACTATTATTAGAGTCAGTAAATACC 
ACAAATAGTCATCAGACATCTGTAAGAGAAGTGAGTGAACCCAGGCACAATCACTTC 
TATTCTCACACTACTCAAATACTTAGCACCTCCACGTTCCCTTCAGATCCACACACA 
GCTGCTCATTCTCAGTTTCCGATCCCTAGAAATAGTACAGTTAACATCCCGCTGTTC 
AGACGCTTTGGGAGGCAGAGGAAAATTGGCGGAAGGGGGCGGATTATCAGCCCATAT 
AGAACTCCAGTTCTGCGACGGCATAGATACAGCATTTTCAGGTCAACAACCAGAGGT 
TCTTCTGTUW^AGCACTACTGCATTCTCAGCCACAGTGCTCAATGTGACATGTCTG 
TCCTGTCTTCCCAGGGAGAGGCTCACCACTGCCACAGCAGCATTGTCTTTTCCAAGT 
GCTGCTCCCATCACCTTCCCCAAAGCTGACATTGCTAGAGTCCCATCAGAAGAGTCT 
ACAACTCTAGTCCAGAATCCACTATTACTACTTGAGAACAAACCCAGTGTAGAGAAA 
ACAACACCCACAATAAAATATTTCAGGACTGAAATTTCCCAAGTGACTCCAACTGGT 
GCAGTCATGACATATGCTCCAACATCCATACCCATGGAAAAAACTCACAAAGTAAAC 
GCCAGTTACCCACGTGTGTCTAGCACCAATGAAGCTAAAAGAGATTCAGTGATTACA 
TCGTCACTTTCAGGTGCTATCACCAAGCCACCAATGACTATTATAGCCATTACAAGG 
TTTTCAAGAAGGAAAATTCCCTGGCAACAGAACTTTGTAAATAACGATAACCCAAAA 

ggcagattaaggaatcaacat7vaagttagtttacaaaaaagcacagctgtgatgctt 
cctaaaacatctcctgctttaccacagagacAaagttcccctttccatttcaccaca 
ctttcaacaagtgtgatgcaaattccatctaataccttgactaccgctcaccacact 
acgaccaaaacacacaatcctggaagtcttccaacaaagaaggagcttcccttccca 
ccccttaaccctatgcttcctagtattataagcaaagactcaagtacaaaaagcatc 
atatcaacgcaaacagcaataccagcaacaactcctaccttccctgcatctgtcatc 
acttatgaaacccaaacagagagatctagagcacaaacaatacaaagagaacaggag 
cctcaaaagaagaacaggactgacccaaacatctctccagaccagagttctggcttc 

ACTACACCCACTGCTATGACACCTCCTGCTCTGGCATTCACTCATTCCCCACCAGAA 
AACACAACTGGCATTTCAAGCACAATCAGTTTTCATTCAAGAACTCTTAATCTGACA 
GATGTGATTGAAGAACTAGCCCAAGCAAGTACTCAGACTTTGAAGAGCACAATTGCT 
TCTGAAACAACTTTGTCCAGCAAATCACACCAGAGTACCACAACTAGGAAAGCATCA 
TTAGACACTCCCATACCACCATTCTTGAGCAGCAGTGCTACTCTAATGCCAGTTCCC 
ATCTCCCCTCCCTTTACTCAGAGAGCAGTTACTGACACACGTGGCGACTCCCATTTC 
CGGCTTATGACAAATACAGTGGTCAAGCTGCACGAATCCTCAAGGCAC7VATCTCCAA 
ATGCCAAGTTCACAATTGGAACCACTCACTTCATCTACCTCTAATCTGTTACATTCT 
ACTCCCATGCCAGCACTAACAACAGTTAAATCACAGAATTCCAAATTAACTCCATCT 
CCCTGGGCAGAATACCAATTTTGGCACAAACCATACTCAGACATTGCTGAAAAAGGC 
AAAAAGCCAGAAGTAAGCATGTTGGCTACTACAGGCCTGTCCGAGGCCACCACTCTT 
GTTTCAGATTGGGATGGACAGAAGAACACAAAGAAGAGTGACTTTGATAAGAAACCA 
GTTCAAGAAGCAACAACTTCCAAACTCCTTCCCTTTGACTCTTTGTCTAGGTATATA 
TTTGA7W\GCCCAGGATAGTTGGAGGAAAAGCTGC7U\GTTTTACTATTCCAGCTAAC 
TCAGATGCCTTTCTTCCCTGTGAAGCTGTTGGAAATCCCCTGCCCACCATTCATTGG 
ACCAGAGTTTCAGGACTTGATTTATCTAGAGGAAACCAGAATAGCAGGGTCCAGGTT 
CTCCCCAATGGTACCCTGTCCATCCAGAGGGTGGAAATTCAGGACCGCGGACAGTAC 
TTGTGTTCCGCATCCAATCTGTTTGGCACAGACCACCTTCATGTCACCTTGTCTGTG 

gtttcctatcctcccaggatcctggagagacgtaccaaagagatcacAgttcattcc 

GGAAGCACTGTGGAACTGAAGTGCAGAGCAGAAGGTAGGCCAAGCCCTACAGTTACC 
TGGATTCTTGCAAACCAAACAGTTGTCTCAGAATCATCCCAGGGAAGTAGGCAGGCT 
GTGGTGACGGTTGACGGAACATTGGTCCTCCACAATCTCAGTATTTATGACCGTGGC 
TTTTACAAATGTGTGGCCAGCAACCCAGGTGGCCAGGATTCACTGCTGGTTAAAATA 
CAAGTCATTGCAGCACCACCTGTTATTCTAGAGCAAAGGAGGCAAGTCATTGTAGGC 
ACTTGGGGTGAAAGTTTAAAACTGCCCTGTACTGCAAAAGGAACTCCTCAGCCCAGC 
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gtttactgggtcctctctgatggcactgaagtgaaaccattacagtttAcc^ttcc^ 

aagttgttcttattttcaaatgggactttgtatataagaaacctagcctct^'gagac 

aggggcacttatgaatgcattgctaccagttccactggttcggagcgaagagtagta 

atgcttacaatggaagagcgagtgaccagccccaggatagaagctgcatcccagaaa 

aggactgaagtgaattttggggacaaattactactgaactgctcagccactggggag 

cccaaaccccaaataatgtggaggttaccatccaaggctgtggtcgaccagtggagc 

TGGATCCACGTCTACCCTAATGGATCCCTGTTTATTGGATCAGTAACAGAAAAAGAC 
AGTGGTGTCTACTTGTGTGTGGCAAGAAACAAAATGGGGGATGATCTGATACTGATG 
CATGTTAGCCTAAGACTGAAACCTGCCAAAATTGACCACAAGCAGTATTTTAGAAAG 
CAAGTGCTCCATGGGAAAGATTTCCAAGTAGATTGCAAAGCTTCCGGCTCCCCAGTG 
CCAGAGATATCTTGGAGTTTGCCTGATGGAACCATGATCAACAATGCAATGCAAGCC 
GATGACAGTGGCCACAGGACTAGGAGATATACCCTTTTCAACAATGGAACTTTATAC 
TTCAACAAAGTTGGGGTAGCGGAGGAAGGAGATTATACTTGCTATGCCCAGAACACC 
CTAGGGAAAGATGAAATGAAGGTCCACTTAACAGTTATAACAGCTGCTCCCCGGATA 
AGGCAGAGTAACAAAACCAACAAGAGAATCAAAGCTGGAGACACAGCTGTCCTTGAC 
TGTGAGGTCACTGGGGATCCCAAACCAAAAATATTTTGGTTGCTGCCTTCCAATGAC 
ATGATTTCCTTCTCCATTGATAGGTACACATTTCATGCCAATGGGTCTTTGACCATC 
AACAAAGTGAAACTGCTCGATTCTGGAGAGTACGTATGTGTAGCCCGAAATCCCAGT 
GGGGATGACACCAAAATGTACAAACTGGATGTGGTCTCTAAACCTCCATTAATCAAT 
GGTCTGTATACAAACAGAACTGTTATTAAAGCCAGAGCTGTGAGACATTCCAAAAAA 
CACTTTGACTGCAGAGCTGAAGGGACACCATCTCCTGAAGTCATGTGGATCATGCCA 
GACAATATTTTCCTCACAGCCCCATACTATGGAAGCAGAATCACAGTCCATAAAAAT 
GGAACCTTGGAAATTAGGAATGTGAGGCTTTCAGATTCAGCCGACTTTATCTGTGTG 
GCCCGAAATGAAGGTGGAGAGAGCGTGTTGGTAGTACAGTTAGAAGTACTGGAAATG 
CTGAGAAGACCGACATTTAGAAATCCATTTAATGAAAAAATAGTTGCCCAGCTGGGA 
AAGTCCACAGCATTGAATTGCTCTGTTGATGGTAACCCACCACCTGAAATAATCTGG 
ATTTTACCAAATGGCACACGATTTTCCAATGGACCACAAAGTTATCAGTATCTGATA 
GCAAGCAATGGTTCTTTTATCATTTCTAAAACAACTCGGGAGGATGCAGGAAAATAT 
CGCTGTGCAGCTAGGAATAAAGTTGGCTATATTGAGAAATTAGTCATATTAGAAATT 
GGCCAGAAGCCAGTTATTCTTACCTATGCACCAGGGACAGTAAAAGGCATCAGTGGA 
GAATCTCTATCACTGCATTGTGTGTCTGATGGAATCCCTAAGCCAAATATCAAATGG 
ACTATGCCAAGTGGTTATGTAGTAGACAGGCCTCAAATTAATGGGAAATACATATTG 
CATGACAATGGCACCTTAGTCATTAAAGAAGCAACAGCTTATGACAGAGGAAACTAT 
ATCTGTAAGGCTCAAAATAGTGTTGGTCATACACTGATTACTGTTCCAGTAATGATT 
GTAGCCTACCCTCCCCGAATTACAAATCGTCCACCCAGGAGTATTGTCACCAGGACA 
GGGGCAGCCTTTCAGCTCCACTGTGTGGCCTTGGGAGTTCCCAAGCCAGAAATCACA 
TGGGAGATGCCTGACCACTCCCTTCTCTCAACGGCAAGTAAAGAGAGGACACATGGA 
AGTGAGCAGCTTCACTTACAAGGTACCCTAGTCATTCAGAATCCCCAAACCTCCGAT 
TCTGGGATATACAAATGCACAGCAAAGAACCCACTTGGTAGTGATTATGCAGCAACG 
TATATTCAAGTAATCTGA 
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Figure 54 Human OCP: predicted amino acid sequence 



MKVKGRGITC LLVSFAVICL VATPGGKACP RRCACYMPTE VHCTFRYLTS 
IPDSIPPNVE RINLGYNSLV RLMETDFSGL TKLELLMLHS NGIHTIPDKT 
FSDLQALQVL KMSYNKVRKL QKDTFYGLRS LTRLHMDHNN IEFINPEVFY 
GLNFLRLVHL EGNQLTKLHP DTFVSLSYLQ IFKISFIKFL YLSDNFLTSL 
PQEMVSYMPD LDSLYLHGNP WTCDCHLKWL SDWIQEKPDV IKCKKDRSPS 
SAQQCPLCMN PRTSKGKPLA MVSAAAFQCA KPTIDSSLKS KSLTILEDSS 
SAFISPQGFM APFGSLTLNM TDQSGNEANM VCSIQKPSRT SPIAFTEEND 
YIVLNTSFST FLVCNIDYGH IQPVWQILAL YSDSPLILER SHLLSETPQL 
YYKYKQVAPK PEDIFTNIEA DLRADPSWLM QDQISLQLNR TATTFSTLQI 
QYSSDAQITL PRAEMRPVKH KWTMISRDNN TKLEHTVLVG GTVGLNCPGQ 
GDPTPHVDWL LADGSKVRAP YVSEDGRILI DKSGKLELQM ADSFDTGVYH 
CISSNYDDAD ILTYRITWE PLVEAYQENG IHHTVFIGET LDLPCHSTGI 
PDASISWVIP GNNVLYQSSR DKKVLNNGTL RILQVTPKDQ GYYRCVAANP 
SGVDFLIFQV SVKMKGQRPL EHDGETEGSG LDESNPIAHL KEPPGAQLRT 
SALMEAEVGK HTSSTSKRHN YRELTLQRRG DSTHRRFREN RRHFPPSARR 
IDPQHWAALL EKAKKNAMPD KRENTTVSPP PWTQLPNIP GEEDDSSGML 
ALHEEFMVPA TKALNLPART VTADSRTISD SPMTNINYGT EFSPWNSQI 
LPPEEPTDFK LSTAIKTTAM SKNINPTMSS QIQGTTNQHS STVFPLLLGA 
TEFQDSDQMG RGREHFQSRP PITVRTMIKD VNVKMLSSTT NKLLLESVNT 
TNSHQTSVRE VSEPRHNHFY SHTTQILSTS TFPSDPHTAA HSQFPIPRNS 
TVNIPLFRRF GRQRKIGGRG RIISPYRTPV LRRHRYSIFR STTRGSSEKS 
TTAFSATVLN VTCLSCLPRE RLTTATAALS FPSAAPITFP KADIARVPSE 
ESTTLVQNPL LLLENKPSVE KTTPTIKYFR TEISQVTPTG AVMTYAPTSI 
PMEKTHKVNA SYPRVSSTNE AKRDSVITSS LSGAITKPPM TIIAITRFSR 
RKIPWQQNFV NNHNPKGRLR NQHKVSLQKS TAVMLPKTSP ALPQRQSSPF 
HFTTLSTSVM QIPSNTLTTA HHTTTKTHNP GSLPTKKELP FPPLNPMLPS 
IISKDSSTKS IISTQTAIPA TTPTFPASVI TYETQTERSR AQTIQREQEP 
QKKNRTDPNI SPDQSSGFTT PTAMTPPALA FTHSPPENTT GISSTISFHS 
RTLNLTDVIE ELAQASTQTL KSTIASETTL SSKSHQSTTT RKASLDTPIP 
PFLSSSATLM PVPISPPFTQ RAVTDTRGDS HFRLMTNTVV KLHESSRHNL 
QMPSSQLEPL TSSTSNLLHS TPMPALTTVK SQNSKLTPSP WAEYQFWHKP 
YSDIAEKGKK PEVSMLATTG LSEATTLVSD WDGQKNTKKS DFDKKPVQEA 
TTSKLLPFDS LSRYIFEKPR IVGGKAASFT IPANSDAFLP CEAVGNPLPT 
IHWTRVSGLD LSRGNQNSRV QVLPNGTLSI QRVEIQDRGQ YLCSASNLFG 
TDHLHVTLSV VSYPPRILER RTKEITVHSG STVELKCRAE GRPSPTVTWI 
LANQTVVSES SQGSRQAVVT VDGTLVLHNL SIYDRGFYKC VASNPGGQDS 
LLVKIQVIAA PPVILEQRRQ VIVGTWGESL KLPCTAKGTP QPSVYWVLSD 
GTEVKPLQFT NSKLFLFSNG TLYIRNLASS DRGTYECIAT SSTGSERRVV 
MLTMEERVTS PRIEAASQKR TEVNFGDKLL LNCSATGEPK PQIMWRLPSK 
AVVDQWSWIH VYPNGSLFIG SVTEKDSGVY LCVARNKMGD DLILMHVSLR 
LKPAKIDHKQ YFRKQVLHGK DFQVDCKASG SPVPEISWSL PDGTMINNAM 
QADDSGHRTR RYTLFNNGTL YFNKVGVAEE GDYTCYAQNT LGKDEMKVHL 
TVITAAPRIR QSNKTNKRIK AGDTAVLDCE VTGDPKPKIF WLLPSNDMIS 
FSIDRYTFHA NGSLTINKVK LLDSGEYVCV ARNPSGDDTK MYKLDVVSKP 
PLTNGLYTNR TVIKATAVRH SKKHFDCRAE GTPSPEVMWI MPDNI FLTAP 
YYGSRITVHK NGTLEIRNVR LSDSADFICV ARNEGGESVL VVQLEVLEML 
RRPTFRNPFN EKIVAQLGKS TALNCSVDGN PPPEIIWILP NGTRFSNGPQ 
SYQYLIASNG SFIISKTTRE DAGKYRCAAR NKVGYIEKLV ILEIGQKPVI 
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LTYAPGTVKG ISGESLSLHC VSDGIPKPNI KWTMPSGYW DRPQINGKYI 
LHDNGTLVIK EATAYDRGNY ICKAQNSVGH TLITVPVMIV AYPPRITNRP 
PRSIVTRTGA AFQLHCVALG VPKPEITWEM PDHSLLSTAS KERTHGSEQL 
HLQGTLVIQN PQTSDSGIYK CTAKNPLGSD YAATYIQVI* 
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